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NUCLEIC ACID MOLECULES ENCODING TRANSMEMBRANE SERINE 
PROTEASES, THE ENCODED PROTEINS AND METHODS BASED THEREON 

RELATED APPLICATIONS 

Benefit of priority is claimed to U.S. provisional application Serial No. 
5 60/179,982, to Edwin L. Madison and Edgar O. Ong, filed February 3, 2000, 
entitled "NUCLEOTIDE AND PROTEIN SEQUENCES OF A TRANSMEMBRANE 
SERINE PROTEASE AND METHODS BASED THEREOF"; to U.S. provisional 
application Serial No. 60/183,542, to Edwin L. Madison and Edgar O. Ong, filed 
February 18, 2000, entitled "NUCLEOTIDE AND PROTEIN SEQUENCES OF A 

10 TRANSMEMBRANE SERINE PROTEASE AND METHODS BASED THEREOF"; to 
U.S. provisional application Serial No. 60/213,124, to Edwin L. Madison and 
Edgar O. Ong, filed June 22, 2000, entitled "NUCLEOTIDE AND PROTEIN 
SEQUENCES OF A TRANSMEMBRANE SERINE PROTEASE AND METHODS 
BASED THEREOF"; to U.S. provisional application Serial No. 60/220,970, to 

15 Edwin L. Madison and Edgar O. Ong, filed July 26, 2000, entitled 

"NUCLEOTIDE AND PROTEIN SEQUENCES OF A TRANSMEMBRANE SERINE 
PROTEASE AND METHODS BASED THEREOF"; and to U.S. provisional 
application Serial No. 60/234,840 to Edwin L. Madison, Edgar 0. Ong and Jiunn- 
Chern Yeh, filed September 22, 2000, entitled "NUCLEIC ACID MOLECULES 

20 ENCODING TRANSMEMBRANE SERINE PROTEASES, THE ENCODED PROTEINS 

* 

AND METHODS BASED THEREON" is claimed herein. Benefit of priority is also 
claimed to U.S. application Serial No. 09/657,968, to Edwin L. Madison, Joseph 
Edward Semple, Gary Samuel Coombs, John Eugene Reiner, Edgar O. Ong, Gian 
Luca Araldi, filed September 8, 2000, entitled "INHIBITORS OF SERINE 

25 PROTEASE ACTIVITY OF MATRIPTASE OR MTSP1 This application is a 
continuation-in-part of U.S. application Serial No. 09/657,986. 

This application is related to U.S. provisional application Serial No. 
60/166,391 to Edwin L. Madison and Edgar O. Ong, filed November 18, 1999 
entitled "NUCLEOTIDE AND PROTEIN SEQUENCES OF PROTEASE DOMAINS OF 

30 ENDOTHELIASE AND METHODS BASED THEREON". This a application is also 
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related to International PCT application No. PCT/US00/31 803, filed November 
17, 2000. 

Where permitted, the above-noted provisional applications', patent 
application and International PCT application are incorporated by reference in 
5 their entirety. All patents, applications, published applications and other 
publications and sequences from GenBank and other data bases referred to 
herein are incorporated by reference in their entirety. 
FIELD OF INVENTION 

Nucleic acid molecules that encode proteases and portions thereof, 
10 particularly protease domains are provided. Also provided are prognostic, 

diagnostic and therapeutic methods using the proteases and domains thereof and 
the encoding nucleic acid molecules. 

BACKGROUND OF THE INVENTION AND OBJECTS THEREOF 

Cancer a leading cause of death in the United States, developing in one in 

15 three Americans; one of every four Americans dies of cancer. Cancer is 

characterized by an increase in the number of abnormal neoplastic cells, which 
proliferate to form a tumor mass, the invasion of adjacent tissues by these 
neoplastic tumor cells, and the generation of malignant cells that metastasize via 
the blood or lymphatic system to regional lymph nodes and to distant sites. 

20 Among the hallmarks of cancer is a breakdown in the communication 

among tumor cells and their environment. Normal cells do not divide in the 
absence of stimulatory signals, cease dividing in the presence of inhibitory 
signals. Growth-stimulatory and growth-inhibitory signals, are routinely 
exchanged between cells within a tissue. In a cancerous, or neoplastic, state, a 

25 cell acquires the ability to "override" these signals and to proliferate under 
conditions in which normal cells do not grow. 

In order to proliferate tumor cells acquire a number of distinct aberrant 
traits reflecting genetic alterations. The genomes of certain well-studied tumors 
carry several different independently altered genes, including activated ■ 

30 oncogenes and inactivated tumor suppressor genes. Each of these genetic 

changes appears to be responsible for imparting some of the traits that, in the 
aggregate, represent the full neoplastic phenotype. 
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A variety of biochemical factors have been associated with different 
phases of metastasis. Cell surface receptors for collagen, glycoproteins such as 
laminin, and proteoglycans, facilitate tumor cell attachment, an important step in 
invasion and metastases. Attachment triggers the release of degradative 
5 enzymes which facilitate the penetration of tumor cells through tissue barriers. 
Once the tumor cells have entered the target tissue, specific growth factors are 
required for further proliferation. Tumor invasion (or progression) involves a 
complex series of events, in which tumor cells detach from the primary tumor, 
break down the normal tissue surrounding it, and migrate into a blood or 
10 lymphatic vessel to be carried to a distant site. The breaking down of normal 
tissue barriers is accomplished by the elaboration of specific enzymes that 
degrade the proteins of the extracellular matrix that make up basement 
membranes and stromal components of tissues. 

A class of extracellular matrix degrading enzymes have been implicated in 
15 tumor invasion. Among these are the matrix metalloproteinases (MMP). For 
example, the production of the matrix metalloproteinase stromelysin is 
associated with malignant tumors with metastatic potential (see, e.g., McDonnell 
etal. (1990) Smnrs. in Cancer Biology /:107-115; McDonnell et al. (1990) 
Cancer and Metastasis Reviews 5:309-31 9). 
20 The capacity of cancer cells to metastasize and invade tissue is facilitated 

by degradation of the basement membrane. Several proteinase enzymes, 
including the MMPs, have been reported to facilitate the process of invasion of 
tumor cells. MMPs are reported to enhance degradation of the basement 
membrane, which thereby permits tumorous cells to invade tissues. For 
25 example, two major metalloproteinases having molecular weights of about 70 
kDa and 92 kDa appear to enhance ability of tumor cells to metastasize. 
Type II Transmembrane Serine Proteases (TTSPs) 
In addition to the MMPs, serine proteases have been implicated in 
neoplastic disease progression. Most serine proteases, which are either 
30 secreted enzymes or are sequestered in cytoplasmic storage organelles, have 
roles in blood coagulation, wound healing, digestion, immune responses and 
tumor invasion and metastasis. A class cell surface proteins designated type II 



WO 01/57194 



PCTAJS01/03471 



-4- 



transmembrane serine proteases, which are membrane-anchored proteins with N- 
terminal extracellular domains, has been identified. As cell surface proteins, they 
are positioned to play a role in intracellular signal transduction and in mediating 
cell surface proteolytic events. 
5 Cell surface proteolysis is a mechanism for the generation of biologically 

active proteins that mediate a variety of cellular functions. These membrane- 
anchored proteins, include a disintegrin-like and metalloproteinase (ADAM) and 
membrane-type matrix metalloproteinase (MT-MMP). Jn mammals, at least 17 
members of the family are known, including seven in humans (see, Hooper et al. 
10 (2001 ) J. Biol. Chem. 276:857-860). These include: corin (accession nos. 
AF1 33845 and AB013874; see, Yan et al. (1999) J. Biol. Chem. 274:14926- 
14938; Tomia et al. (1998) J. Biochem. 724:784-789; Uan et al. (2000) Proc. 
Natl. Acad. Sci. U.S.A. 57:8525-8529); enterpeptidase (also designated 
enterokinase; accession no. U09860 for the human protein; see, Kitamoto et al. 
15 (1995) Biochem. 27: 4562-4568; Yahagi et al. (1996) Biochem. Biophys. Res. 
Commun. 2/5:806-812; Kitamoto et al. (1994) Proc. Natl. Acad. Sci. U.S.A. 
57:7588-7592; Matsushima et al. (1994) J. Biol. Chem. 265:19976-19982;); 
human airway trypsin-like protease (HAT; accession no. AB002134; see 
Yamaoka et al. J. Biol. Chem. 273:11894-11901); MTSP1 and matriptase (also 
20 called TADG-15; see SEQ ID Nos. 1 and 2; accession nos. 

AF133086/AF1 18224, AF04280022; Takeuchi et al. (1999) Proc. Natl. Acad. 
ScL U.S.A. 56:11054-1161; Lin et al. (1 999) J. Biol. Chem. 274/18231-18236; 
Takeuchi et al. (2000) J. Biol. Chem. 275:26333-26342; and Kim et al. (1999) 
fmmunogenetics 45:420-429); hepsin {see, accession nos. M18930, AF030065, 
25 X70900; Leytus et al. (1988) Biochem. 27: 1 1895-1 1901; Vu et al. (1997) J. 
Biol. Chem. 272:31315-31320; and Farley et al. (1993) Biochem. Biophys. Acta 
7/75:350-352; and see, U.S. Patent No. 5,972,616); TMPRS2 (see, Accession 
Nos. U75329 and AF1 13596; Paoloni-Giacobino et al. (1997) Genomics 44:309- 
320; and Jacquinet et al. (2000) FEBS Lett. 468: 93-100); and TMPRSS4 (see, 
30 Accession No. NM 016425; Wallrapp et al. (2000) Cancer 60:2602-2606). 

Serine proteases, including transmembrane serine proteases, have been 
implicated in processes involved in neoplastic development and progression. 
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While the precise role of these proteases has not been elaborated, serine 
proteases and inhibitors thereof are involved in the control of many intra- and 
extracellular physiological processes, including degradative actions in cancer cell 
invasion, metastatic spread, and neovascularization of tumors, that are involved 
5 in tumor progression. It is believed that proteases are involved in the 

degradation of extracellular matrix (ECM) and contribute to tissue remodeling, 
and are necessary for cancer invasion and metastasis. The activity and/or 
expression of some proteases have been shown to correlate with tumor 
progression and development. 
10 For example, a membrane-type serine protease MTSP1 (also called 

matriptase; see SEQ ID Nos. 1 and 2 from U.S. Patent No. 5,972,616; and 
GenBank Accession No. AF1 18224; (1999) J. Biol. Chem. 274:18231-18236; 
U.S. Patent No. 5,792,616; see, also Takeuchi (1999) Proc. Natl. Acad. Sci. 
U.S.A. 36:1 1054-1 161) that is expressed in epithelial cancer and normal tissue 
15 (Takeucuhi et al. (1999) Proc. Natl. Acad. Scl. USA, 96(20) :1 1054-61 ) has been 
identified. Matriptase was originally identified in human breast cancer cells as a 
major gelatinase (see, U.S. Patent No. 5,482,848), a type of matrix 
metalloprotease (MMP). It has been proposed that it plays a role in the 
metastasis of breast cancer. Its primary cleavage specificity is Arg-Lys residues. 
20 Matriptase also is expressed in a variety of epithelial tissues with high levels of 
activity and/or expression in the human gastrointestinal tract and the prostate. 

Prostate-specific antigen (PSA), a kallikrein-like serine protease, degrades 
extracellular matrix glycoproteins fibronectin and laminin, and, has been 
postulated to facilitate invasion by prostate cancer cells (Webber et al. (1995) 
25 Clin. Cancer Res., 1(101 :1089-94). Blocking PSA proteolytic activity with 

PSA-specific monoclonal antibodies results in a dose-dependent decrease in vitro 
in the invasion of the reconstituted basement membrane Matrigel by LNCaP 
human prostate carcinoma cells which secrete high levels of PSA. 

Hepsin, a cell surface serine protease identified in hepatoma cells, is 
30 overexpressed in ovarian cancer (Tanimoto et al. (1997) Cancer Res., 

571141:2884-7). The hepsin transcript appears to be abundant in carcinoma 
tissue and is almost never expressed in normal adult tissue, including normal 
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ovary. It has been suggested that hepsin is frequently overexpressed in ovarian 
tumors and therefore may be a candidate protease in the invasive process and 
growth capacity of ovarian tumor cells. 

A serine protease-like gene, designated normal epithelial cell-specific 1 
5 (NES1J (Liu et al., Cancer Res., 56(141 :3371-9 (1996)) has been identified. 
Although expression of the NES1 mRNA is observed in all normal and 
immortalized nontumorigenic epithelial cell lines, the majority of human breast 
cancer cell lines show a drastic reduction or a complete lack of its expression. 
The structural similarity of NES1 to polypeptides known to regulate growth 
10 factor activity and a negative correlation of NES1 expression with breast 

oncogenesis suggest a direct or indirect role for this protease-like gene product 
in the suppression of tumorigenesis. 

Hence transmembrane serine proteases appear to be involved in the 
etiology and pathogenesis of tumors. There is a need to further elucidate their 
15 role in these processes and to identify additional transmembrane proteases. 
Therefore, it is an object herein to provide transmembrane serine protease 
(MTSP) proteins and nucleic acids encoding such MTSP proteases that are 
involved in the regulation of or participate in tumorigenesis and/or 
carcinogenesis. It is also an object herein to provide prognostic, diagnostic, 
20 therapeutic screening methods using the such proteases and the nucleic acids 
encoding such proteases. 
SUMMARY OF THE INVENTION 

Provided herein are isolated protease domains of the Transmembrane 
Serine Protease family, particularly the Type II Transmembrane Serine Protease 
25 (TTSP) family (also referred to herein as MTSPs), and more particularly TTSP 
family members whose functional activity differs in tumor cells from non-tumor 
cells in the same tissue. For example, the MTSPs include those that are 
activated and/or expressed in tumor cells at different levels, typically higher, 
from non-tumor cells; and those from cells in which substrates therefor differ in 
30 tumor cells from non-tumor cells or otherwise alter the specificity of the MTSP. 

The MTSP family as intended herein does not include any membrane 
anchored or spanning proteases that are expressed on endothelial cells. 
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Included among the MTSPs are several heretofore unidentified MTSP family 
members, designated herein as MTSP3 and MTSP4 and a new form of a protein 
designated herein as MTSP6. In addition to the protease domains of each of 
MTSP3 and MTSP4, the full-length proteins, including those that results from 
5 splice variants, zymogens and activated forms, and uses thereof, are also 
provided. 

The protease domains as provided herein are single-chain polypeptides, 
with an N-terminus (such as IV, VV, IL and II) generated at the cleavage site 
(generally having the consensus sequence R^VVGG, RilVGG, R4IVNG, 

10 R1ILGG, RIVGLL, RilLGG or a variation thereof; an N-terminus R* V or Rll, 

where the arrow represents the cleavage point) when the zymogen is activated. 
To identify the protease domain an Rl should be identified, and then the 
following amino acids compared to the above noted motif. 

The protease domains generated herein, however, do not result from 

15 activation, which produces a two chain activated product, but rather are single 
chain polypeptides with the N-terminus include the consensus sequence 
I VVGG, IIVGG, 4 VGLL, I ILGG or IIVNG or other such motif at the N- 
terminus. As shown herein, such polypeptides, although not the result of 
activation and not double-chain forms, exhibit proteolytic (catalytic) activity. 

20 These protease domain polypeptides are used in assays to screen for agents that 
modulate the activity of the MTSP. Such assays are also provided herein. In 
exemplary assays, the affects of test compounds in the ability of a protease 
domains to proteolytically cleave a known substrate, typically a fluorescently, 
chromogenically or otherwise detectably labeled substrate, are assessed. 

25 Agents, generally compounds, particularly small molecules, that modulate the 
activity of the protease domain are candidate compounds for modulating the 
activity of the MTSP. The protease domains can also be used to produce single- 
chain protease-specific antibodies. The protease domains provided herein 
include, but are not limited to, the single chain region having an N-terminus at 

30 the cleavage site for activation of the zymogen, through the C-terminus, or C- 
terminal truncated portions thereof that exhibit proteolytic activity as a single- 
chain polypeptide in in vitro proteolysis assays, of any MTSP family member. 
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preferably from a mammal, including and most preferably human, that, for 
example, is expressed in tumor cells at different levels from non-tumor cells, and 
that is not expressed on an endothelial cell. These include, but are not limited to 
: MTSP1 (or matriptase), MTSP3, MTSP4 and MTSP6. Other MTSP protease 
5 domains of interest herein, particularly for use in in vitro drug screening 
proteolytic assays, include, but are not limited to: corin (accession nos. 
AF1 33845 and AB013874; see, Yan et at. (1999) J. Biol. Chem. 274:14926- 
14938; Tomia etal. (1998) J. Biochem. 724:784-789; Uan et aL (2000) Proc. 
Natl. Acad. ScL U.S.A. 57:8525-8529; SEQ ID Nos. 61 and 62 for the human 

10 protein); enterpeptidase (also designated enterokinase; accession no. U09860 
for the human protein; see, Kitamoto et aL (1995) Biochem. 27: 4562-4568; 
Yahagi et al. (1996) Biochem. Biophys. Res. Commun. 2/5:806-812; Kitamoto 
etal. (1994) Proc. Natl. Acad. ScL U.S.A. 57:7588-7592; Matsushima et al. 
(1994) J. Biol. Chem. 255:19976-19982; see SEQ ID Nos. 63 and 64 for the 

15 human protein); human airway trypsin-like protease (HAT; accession no. 

AB002134; see Yamaoka etal. J. BioL Chem. 273:1 1894-1 1901 ; SEQ ID Nos. 
65 and 66 for the human protein); hepsin (see, accession nos. M18930, 
AF030065, X70900; Leytus etal. (1988) Biochem. 27: 1 1895-1 1901; Vu etal. 
(1997)*/. Biol. Chem. 272:31315-31320; and Farley etal. (1993) Biochem. 

20 Biophys. Acta 7 775:350-352; SEQ ID Nos. 67 and 68 for the human protein); 
TMPRS2 (see, Accession Nos. U75329 and AF1 13596; Paoloni-Giacobino etal. 
(1 997) Genomics 44:309-320; and Jacquinet et al. (2000) FEBS Lett. 468; 93- 
100; SEQ ID Nos. 69 and 70 for the human protein) TMPRSS4 (see, Accession 
No. NM 016425; Wallrapp etal. (2000) Cancer 60:2602-2606; SEQ ID Nos. 71 

25 and 72 for the human protein); and TADG-12 (also designated MTSP6, see SEQ 
ID Nos. 1 1 and 12; see International PCT application No. WO 00/52044, which 
claims priority to U.S. application Serial No. 09/261,416). 

Also provided are muteins of the single chain protease domains and 
MTSPs, particularly muteins in which the Cys residue in the protease domain 

30 that is free {I.e., does not form disulfide linkages with any other Cys residue in 
the protein) is substituted with another amino acid substitution, preferably with a 
conservative amino acid substitution or a substitution that does not eliminate the 
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activity, and muteins in which a gtycosylation site(s) is eliminated. Muteins in 
which other conservative amino acid substitutions in which catalytic activity is 
retained are also contemplated (see, e.g., Table 1, for exemplary amino acid 
substitutions). See, also, Figure 4, which identifies the free Cys residues in 
5 MTSP3, MTSP4 and MTSP6. 

Hence, provided herein are members of a family of transmembrane serine 
protease (MTSP) proteins, and functional domains, especially protease (or 
catalytic) domains thereof, muteins and other derivatives and analogs thereof. 
Also provided herein are nucleic acids encoding the MTSPs. 

10 Exemplary MTSPs (see, e.g., SEQ ID No. 1-12, 49 and 50) are provided 

herein, as are the single chain protease domains thereof as follows: SEQ ID 
Nos. 1, 2, 49 and 50 set forth amino acid and nucleic acid sequences of MTSP1 
and the protease domain thereof; SEQ JD No. 3 sets forth the MTSP3 nucleic 
acid sequence and SEQ ID No. 4 the encoded MTSP3 amino acids; SEQ ID No. 5 

15 MTSP4 a nucleic acid sequence of the protease domain and SEQ ID No. 6 the 
encoded MTSP4 amino acid protease domain; SEQ ID No. 7 MTSP4-L a nucleic 
acid sequence and SEQ ID No. 8 the encoded MTSP4-L amino acid sequence; 
SEQ ID No. 9 an MTSP4-S. encoding nucleic acid sequence and SEQ ID No. 10 
the encoded MTSP4-S amino acid sequence; and SEQ ID No. 1 1 an MTSP6 

20 encoding nucleic acid sequence and SEQ ID No. 12 the encoded MTSP6 amino 
acid sequence. The single chain protease domains of each are delineated below. 

Nucleic acid molecules that encode a single-chain protease domain or 
catalytically active portion thereof are provided. Also provided are nucleic acid 
molecules that hybridize to such MTSP encoding nucleic acid along their full 

25 length and encode the protease domain or portion thereof are provided. 

Hybridization is preferably effected under conditions of at least low, generally at 
least moderate, and often high stringency. 

Additionally provided herein are antibodies that specifically bind to the 
MTSPs, cells, combinations, kits and articles of manufacture that contain the 

30 nucleic acid encoding the MTSP and/or the MTSP. Further provided herein are 
prognostic, diagnostic, therapeutic screening methods using MTSPs and the 
nucleic acids encoding MTSP. Also provided are transgenic non-human animals 
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bearing inactivated genes encoding the MTSP and bearing the genes encoding 
the MTSP under non-native promoter control. Such animals are useful in animal 
models of tumor initation, growth and/or progression models. 

Provided herein are members of a family of membrane serine proteases 
5 (MTSP) that are expressed in certain tumor or cancer cells such lung, prostate, 
colon and breast cancers. In particular, it is shown herein, that MTSPs, 
particularly, MTSP3, MTSP4 and MTSP6 are expressed in lung carcinoma, breast 
carcinoma, colon adenocarcinoma and/or ovarian carcinomas as well as in 
certain normal cells and tissues {see e.g., EXAMPLES for tissue-specific 

10 expression profiles of each protein exemplified herein). The MTSPs that are of 
particular interest herein, are those that are expressed in tumor cells, for 
example, those that appear to be expressed at different levels in tumor cells 
from normal cells, or whose functional activity is different in tumor cells from 
normal cells, such as by an alteration in a substrate therefor, or a cofactor. 

15 Hence the MTSP provided herein can serve as diagnostic markers for certain 

tumors. The level of activated MTSP3, MTSP4 and MTSP6 can be diagnostic of 
prostate cancer. In addition, MTSP4 is expressed and/or activated in 
lymphomas, leukemias, lung cancer, breast, prostrate and colon cancers. 
MTSP6 is activated and/or expressed in breast, lung, prostate, colon and ovarian 

20 cancers. Furthermore, compounds that modulate the activity of these MTSPs, 
as assessed by the assays provided herein, particularly the in vitro proteolytic 
assays that use the single chain protease domains, are potential therapeutic 
candidates for treatment of various malignancies and neoplastic disease. 

Also provided herein are methods of modulating the activity of the MTSPs 

25 and screening for compounds that modulate, including inhibit, antagonize, 

agonize or otherwise alter the activity of the MTSPs. Of particular interest is the 
extracellular domain of these MTSPs that includes the proteolytic (catalytic) 
portion of the protein. 

MTSP proteins, including, but not limited to, MTSP3, MTSP4, and 

30 MTSP6, including splice variants thereof, and nucleic acids encoding MTSPs, 
and domains, derivatives and analogs thereof are provided herein. Single chain 
protease domains, in the N-terminal is that which would be generated by 



RECTIFIED SHEET (RULE 91) 



WO 01/57194 



PCT/US01/03471 



activation of the zymogen, from any MTSP, particularly those that are not 
expressed in endothelial cells and that are expressed in tumor cells are also 
provided. 

Antibodies that specifically bind to the MTSP, particularly the single chain 
5 protease domain, and any and all forms of MTSP3 and MTSP4, and cells, 
combinations, kits and articles of manufacture containing the MTSP proteins, 
domains thereof, or encoding nucleic acids are also provided herein. Transgenic 
non-human animals bearing inactivated genes encoding the MTSP and bearing 
the genes encoding the MTSP under a non-native promotor control are 
10 additionally provided herein. Also provided are nucleic acid molecules encoding 
each of the MTSPs and domains thereof. 

Also provided are plasmids containing any of the nucleic acid molecules 
provided herein. Cells containing the plasmids are also provided. Such cells 
include, but are not limited to, bacterial cells, yeast cells, fungal cells, plant cells, 
15 insect cells and animal cells. 

Also provided is a method of producing a MTSP by growing the above- 
described cells under conditions whereby the MTSP is expressed by the cells, 
and recovering the expressed MTSP protein. Methods for isolating nucleic acid 
encoding other MTSPs are also provided. 
20 Also provided are cells, preferably eukaryotic cells, such as mammalian 

cells and yeast cells, in which the MTSP protein, preferably MTSP3 and MTSP4, 
is expressed in the surface of the cells. Such cells are used in drug screening 
assays to identify compounds that modulate the activity of the MTSP protein. 
These assays including in vitro binding assays, and transcription based assays in 
25 which signal transduction mediated by the MTSP is assessed. 

Further provided herein are prognostic, diagnostic and therapeutic 
screening methods using the MTSP and the nucleic acids encoding MTSP. In 
particular, the prognostic, diagnostic and therapeutic screening methods are 
used for preventing, treating, or for finding agents useful in preventing or 
30 treating, tumors or cancers such as lung carcinoma, colon adenocarcinoma and 
ovarian carcinoma. 
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Also provided are methods for screening for compounds that modulate 
the activity of any MTSP. The compounds are identified by contacting them 
with the MTSP and a substrate for the MTSP. A change in the amount of 
substrate cleaved in the presence of the compounds compared to that in the 
5 absence of the compound indicates that the compound modulates the activity of 
the MTSP. Such compounds are selected for further analyses or for use to 
modulate the activity of the MTSP, such as inhibitors or agonists. The 
compounds can also be identified by contacting the substrates with a cell that 
expresses the MTSP or the extracellular domain or proteolytrcally active portion 
.10 thereof. For assays in which the extracellular domain or a proteolytically active 
portion thereof is employed, the MTSP is any MTSP that is expressed on cells, 
other than endothelial cells, including, but not limited to MTSP1, MTSP3, MTSP4 
and MTSP6. 

Also provided herein are modulators of the activity of the MTSP, 
1 5 especially the modulators obtained according to the screening methods provide 
herein. Such modulators may have use in treating cancerous conditions, and 
other neoplastic conditions. 

Pharmaceutical composition containing the protease domains of an MTSP 
protein, and the MTSP proteins, MTSP3, MTSP4 and MTSP6 are provided herein 
20 in a pharmaceutically acceptable carrier or excipient are provided herein. 

Also provided are articles of manufacture that contain the MTSP proteins 
and protease domains of MTSPs in single chain form. The articles contain a) 
packaging material; b) the polypeptide (or encoding nucleic acid), particularly the 
single chain protease domain thereof; and c) a label indicating that the article is 
25 for using ins assays for identifying modulators of the activities of an MTSP 
protein is provided herein. 

Conjugates containing a) a MTSP protease domain in single chain from; 
and b) a targeting agent linked to the MTSP directly or via a linker, wherein the 
agent facilitates: i) affinity isolation or purification of the conjugate; ii) 
30 attachment of the conjugate to a surface; iii) detection of the conjugate; or iv) 
targeted delivery to a selected tissue or cell, is provided herein. The conjugate 
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can contain a plurality of agents linked thereto. The conjugate can be a 
chemical conjugate; and it can be a fusion protein. 

In yet another embodiment, the targeting agent is a protein or peptide 
fragment. The protein or peptide fragment can include a protein binding 
5 sequence, a nucleic acid binding sequence, a lipid binding sequence, a 
polysaccharide binding sequence, or a metal binding sequence. 

Method of diagnosing a disease or disorder characterized by detecting an 
aberrant level of an MTSP, particularly an MTSP3, MTSP4 or MTSP 6, in a 
subject is provided. The method can be practiced by measuring the level of the 
10 DNA, RNA, protein or functional activity of the MTSP. An increase or decrease 
in the level of the DNA, RNA, protein or functional activity of the MTSP, relative 
to the level of the DNA, RNA, protein or functional activity found in an 
analogous sample not having the disease or disorder (or other suitable control) is 
indicative of the presence of the disease or disorder in the subject or other 
15 relative any other suitable control. 

Combinations are provided herein. The combination can include: a) an 
inhibitor of the activity of an MTSP; and b) an anti-cancer treatment or agent. 
The MTSP inhibitor and the anti-cancer agent can be formulated in a single 
pharmaceutical composition or each is formulated in a separate pharmaceutical 
20 composition. The MTSP inhibitor can be an antibody or a fragment or binding 
portion thereof against the MTSP, such as an antibody that specifically binds to 
the protease domain, an inhibitor of the MTSP production, or an inhibitor of the 
MTSP membrane-localization or an inhibitor of MTSP activation. Other MTSP 
inhibitors include, but are not limited to, an antisense nucleic acid encoding the 
25 MTSP, particularly a portion of the protease domain; a nucleic acid encoding at 
least a portion of a gene encoding the MTSP with a heterologous nucleotide 
sequence inserted therein such that the heterologous sequence inactivates the 
biological activity encoded MTSP or the gene encoding it. The portion of the 
gene encoding the MTSP preferably flanks the heterologous sequence to 
promote homologous recombination with a genomic gene encoding the MTSP. 

Also, provided are methods for treating or preventing a tumor or cancer in 
a mammal by administering to a mammal an effective amount of an inhibitor of 
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an MTSP3, MTSP4 or MTSP6, whereby the tumor or cancer is treated or 
prevented. The MTSP inhibitor used in the treatment or for prophylaxis is 
administered with a pharmaceutically acceptable carrier or excipient. The 
mammal treated can be a human. The treatment or prevention method can 
5 additionally include administering an anti-cancer treatment or agent 

simultaneously with or subsequently or before administration of the MTSP 
inhibitor. 

Also provided is a recombinant non-human animal in which an 
endogenous gene of an MTSP has been deleted or inactivated by homologous 
10 recombination or insertional mutagenesis of the animal or an ancestor thereof. A 
recombinant non-human animal is provided herein, where the gene of an MTSP 
is under control of a promoter that is not the native promoter of the gene or that 
is not the native promoter of the gene in the non-human animal or where the 
nucleic acid encoding the MTSP is heterologous to the non-human animal and 
15 the promoter is the native or a non-native promoter. 

Also provided are methods of treatments of tumors by administering a 
prodrug that is activated by an MTSP that is expressed or active in tumor cells, 
particularly those in which its functional activity in tumor cells is greater than in 
none-tumor cells. The prodrug is administered and, upon administration, active 
20 MTSP expressed on cells cleaves the prodrug and releases active drug in the 

vicinity of these cells. The active anti-cancer drug accumulates in the vicinity of 
the tumor. This is particularly useful in instances in which an MTSP is expressed 
or active in greater quantity, higher level or predominantly in tumor cells 
compared to other cells. 
25 BRIEF DESCRIPTION OF DRAWINGS 

Figure 1 illustrates the domain organization of the MTSP3; 
Figure 2 illustrates the domain organization of the MTSP4 splice variants 
and domains thereof; MTSP4-L includes a transmembrane domain, a CUB 
domain, a low density lipoprotein receptor (LDLR) domains, and a serine protease 
30 catalytic domain; MTSP4-S lacking the portion between amino acids 1 36-279. 

Figure 3 depicts the domain organization of MTSP6. 
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Figure 4 provides an alignment of the C-terminal portions of MTSP3, the 
two splice variant-encoded forms of MTSP4, and MTSP6, that encompasses the 
protease domains thereof; the figure shows the cleavage sites, which form the 
N-terminus of the protease domain of each protein; a potential glycosylation site 
5 is noted and the free Cys residues in the protease domain of each. are noted (*). 
Muteins of each protein may be prepared by replacing the residues in the 
glycosylation site, particularly the N residue, and the free Cys residues, with 
preferably conservative amino acid residues. Such muteins are also provided 
herein. 

10 DETAILED DESCRIPTION OF THE INVENTION 
A. DEFINITIONS 

Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as is commonly understood by one of ordinary skill in 
the art to which this invention belongs. All patents, applications, published 

15 applications and other publications and sequences from GenBank and other data 
bases referred to herein are incorporated by reference in their entirety. 

As used herein, the abbreviations for any protective groups, amino acids 
and other compounds, are, unless indicated otherwise, in accord with their 
common usage, recognized abbreviations, or the IUPAC-IUB Commission on 

20 Biochemical Nomenclature (see, (1972) Biochem. 1 1 :942-944). 

As used herein, serine protease refers to a diverse family of proteases 
wherein a serine residue is involved in the hydrolysis of proteins or peptides. 
The serine residue can be part of the catalytic triad mechanism, which includes a 
serine, a histidine and an aspartic acicJ in the catalysis, or be part of the 

25 hydroxyl/e-amine or hydroxyl/a-amine catalytic dyad mechanism, which involves 
a serine and a lysine in the catalysis. 

As used herein, "transmembrane serine protease (MTSP) n refers to a 
family of transmembrane serine proteases that share common structural features 
as described herein (see, also Hooper eta/. (2001) J. Biol. Chem. 275:857-860). 

30 Thus, reference, for example, to °MTSP" encompasses all proteins encoded by 
the MTSP gene family, including but are not limited to: MTSP1, MTSP3, MTSP4 
and MTSP6, or an equivalent mol cule obtained from any other source or that 
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has been prepared synthetically or that exhibits the same activity. Other MTSPs 
include, but are not limited to, corin, enterpeptidase, human airway trypsin-like 
protease (HAT), MTSP1 , TMPRSS2, and TMPRSS4. Sequences of encoding 
nucleic molecules and the encoded amino acid sequences of exemplary MTSPs 
5 and/or domains thereof are set forth in SEQ ID Nos. 1-12, 49, 50 and 61-72. 

The term also encompass MTSPs with conservative amino acid substitutions that 
do not substantially alter activity of each member, and also encompasses splice 
variants thereof. Suitable conservative substitutions of amino acids are known 
to those of skill in this art and may be made generally without altering the 

10 biological activity of the resulting molecule. ,Of particular interest are MTSPs of 
mammalian, including human, origin. Those of skill in this art recognize that, in 
general, single amino acid substitutions in non-essential regions of a polypeptide 
do not substantially alter biological activity (see, e.g. , Watson et aL Molecular 
Biology of the Gene, 4th Edition, 1987, The Bejacmin/Cummings Pub. co., 

15 p.224). 

As used herein, a "protease domain of an MTSP" refers to the protease 
domain of MTSP that is located within the extracellular domain of a MTSP and 
exhibits serine proteolytic activity. It includes at least the smallest fragment 
thereof that acts catalytically as a single chain form. Hence it is at least the 

20 minimal portion of the extracellular domain that exhibits proteolytic activity as 
assessed by standard assays in vitro assays. Those of skill in this art recognize 
that such protease domain is the portion of the protease that is structurally 
equivalent to the trypsin or chymotrypsin fold. 

Exemplary MTSP proteins, with the protease domains indicated, are 

25 illustrated in Figures 1-3, Smaller portions thereof that retain protease activity 
are contemplated. The protease domains vary in size and constitution, including 
insertions and deletions in surface loops. They retain conserved structure, 
including at least one of the active site triad, primary specificity pocket, 
oxyanion hole and/or other features of serine protease domains of proteases. 

30 Thus, for purposes herein, the protease domain is a portion of a MTSP, as 

defined herein, and is homologous to a domain of other MTSPs, such as corin, 
enterpeptidase, human airway trypsin-like protease (HAT), MTSP1, TMPRSS2, 
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and TMPRSS4, which have been previously identified; it was not recogniz d, 
however, that an isolated single chain form of the protease domain could 
function proteolytically in in vitro assays. As with the larger class of enzymes of 
the chymotrypsin (S1) fold (see, e.g., Internet accessible MEROPS data base), 
5 the MTSPs protease domains share a high degree of amino acid sequence 
identity. The His, Asp and Ser residues necessary for activity are present in 
conserved motjfs. The activation site, which results in the N-terminus of second 
chain in the two chain forms is has a conserved motif and readily can be 
identified (see, e.g., amino acids 801-806, SEQ ID No. 62, amino acids 406- 
10 410, SEQ ID No. 64; amino acids 186-190, SEQ ID No. 66; amino acids 161- 
166, SEQ ID No. 68; amino acids 255-259, SEQ ID No. 70; amino acids 190- 
194, SEQ ID No. 72). 

As used herein, the catalytically active domain of an MTSP refers to the 
protease domain. Reference to the protease domain of an MTSP refers includes 

15 the single and double-chain forms of any of these proteins. The zymogen form 
of each protein is single chain form, which can be converted to the active two 
chain form by cleavage. The protease domain may also be converted to a two 
chain form. By active form is meant a form active in vivo. 

Significantly, it is shown herein, that, at least in vitro, the single chain 

20 forms of the MTSPs and the catalytic domains or proteolytically active portions 
thereof (typically C-terminal truncations) thereof exhibit protease activity. Hence 
provided herein are isolated single chain forms of the protease domains of 
MTSPs and their use in in vitro drug screening assays for identification of agents 
that modulate the activity thereof. 

25 As used herein an MTSP3, whenever referenced herein, includes at least 

one or all of or any combination of: 

a polypeptide encoded by the sequence of nucleotides set forth in 

SEQ ID No. 3; 

a polypeptide encoded by a sequence of nucleotides that 
30 hybridizes under conditions of low, moderate or high stringency to the sequence 
of nucleotides set forth in SEQ ID No. 3; 
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a polypeptide that comprises the sequence of amino acids set 
forth as amino acids 205-437 of SEQ ID No. 4; 

a polypeptide that comprises a sequence of amino acids having at 
least about 85% or 90% sequence identity with the sequence of amino acids set 
5 forth in SEQ ID No. 4; and/or 

a splice variant of the MTSP3 set forth in SEQ ID Nos. 3 and 4. 
The MTSP3 may be from any animal, particularly a mammal, and includes 
but are not limited to, humans, rodents, fowl, ruminants and other animals. The 
full length zymogen or double chain activated form is contemplated or any 
0 domain thereof, including the protease domain, which can be a double chain 
activated form, or a single chain form. 

As used herein an MTSP4, whenever referenced herein, includes at least 
one or all of or any combination of: 

a polypeptide encoded by the sequence of nucleotides set forth in 
5 any of SEQ ID No. 5, 7 or 9; 

a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of low, moderate or high stringency to the sequence 
of nucleotides set forth in any of SEQ ID Nos. 5, 7 or 9; 

a polypeptide that comprises the sequence of amino acids set 
forth in any of SEQ ID Nos. 6, 8 or 10; 

a polypeptide that comprises a sequence of amino acids having at 
least about 85% or 90% or 95% sequence identity with the sequence of amino 
acids set forth in SEQ ID No. 6, 8 or 10; and/or 

a splice variant of the MTSP4s set forth in SEQ ID Nos. 7-10. 
The MTSP4 may be from any animal, particularly a mammal, and includes 
but are not limited to, humans, rodents, fowl, ruminants and other animals. The 
full length zymogen or double chain activated form is contemplated or any 
domain thereof, including the protease domain, which can be a double chain 
activated form, or a single chain form. 

As used herein an MTSP6, whenever referenced herein, includes at least 
one or all of or any combination of: 
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a polypeptide encoded by the sequence of nucleotides set forth in 
any of SEQ ID No. 11; 

a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of low, moderate or high stringency to the sequence 
5 of nucleotides set forth in any of SEQ ID Nos. 1 1 ; 

a polypeptide that comprises the sequence of amino acids set 
forth in any of SEQ ID Nos. 1 2; 

a polypetide that comprises a sequence of amino acids having at 
least about 90% or 95% or 98% sequence identity with the sequence of amino 
10 acids set forth in SEQ ID No. 1 2; and/or 

a splice variant of the MTSP4s set forth in SEQ ID No. 1 2. 
The MTSP6 may be from any animal, particularly a mammal, and includes but 
are not limited to, humans, rodents, fowl, ruminants and other animals. The full 
length zymogen or double chain activated form is contemplated or any domain 
1 5 thereof, including the protease domain, which can be a double chain activated 
form, or a single chain form. Of particular interest herein is the MTSP6 of SEQ 
ID No. 12. 

As used herein, a human protein is one encoded by DNA present in the 
genome of a human, including all allelic variants and conservative variations as 
20 long as they are not variants found in other mammals. 

As used herein, a "nucleic acid encoding a protease domain or 
catalytically active portion of a MTSP" shall be construed as referring to a 
nucleic acid encoding only the recited single chain protease domain or active 
portion thereof, and not the other contiguous portions of the MTSP as a 
25 continuous sequence. 

As used herein, a CUB domain is a motif that mediates protein-protein 
interactions in complement components C1r/C1s and has also been identified in 
various proteins involved in developmental processes. 

As used herein, catalytic activity refers to the activity of the MTSP as a 
30 serine proteases. Function of the MTSP refers to its function in tumor biology, 
including promotion of or involvement in tumorigenesis, metastasis or 
carcinogenesis, and also roles in signal transduction. 
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As used herein, a "propeptide" or "pro sequence" is sequence of amino 
acids positioned at the amino terminus of a mature biologically active 
polypeptide. When so-positioned, the resulting polypeptide is called a zymogen. 
Zymogens, generally, are biologically inactive and can be converted to mature 
5 active polypeptides by catalytic or autocatalytic cleavage of the propeptide from 
the zymogen. A zymogen is an enzymatically inactive protein that is converted 
to a proteolytic enzyme by the action of an activator. Cleavage may be effected 
autocatalytically. 

As used herein, "disease or disorder" refers to a pathological condition in 
10 an organism resulting from, e.g., infection or genetic defect, and characterized 
by identifiable symptoms. 

As used herein, neoplasm (neoplasia) refers to abnormal new growth, and 
thus means the same as tumor, which may be benign or malignant. Unlike 
hyperplasia, neoplastic proliferation persists even in the absence of the original 
15 stimulus. 

As used herein, neoplastic disease refers to any disorder involving cancer, 
including tumor development, growth, metastasis and progression. 

As used herein, cancer refers to a general term for diseases caused by 
any type of malignant tumor. 
20 As used herein, malignant, as applies to tumors, refers to primary tumors 

that have the capacity of metastasis with loss of growth control and positional 
control* 

As used herein, an anti-cancer agent (used interchangeable with "anti- 
tumor or antineoplastic agent") refers to any agents used in the anti-cancer 

25 treatment. These include any agents, when used alone or in combination with 
other compounds, that can alleviate, reduce, ameliorate, prevent, or place or 
maintain in a state of remission of clinical symptoms or diagnostic markers 
associated with neoplastic disease, tumor and cancer, and can be used in 
methods, combinations and compositions provided herein. Non-limiting 

30 examples of antineoplastic agents include anti-angiogenic agents, alkylating 
agents, antimetabolite, certain natural products, platinum coordination 
complexes, anthracenediones, substituted ureas, methylhydrazine derivatives, 



RECTIFIED SHEET (RULE 91) 



WO 01/57194 



PCTAJS01/03471 



-21- 



adrenocortical suppressants, certain hormones, antagonists and anti-cancer 
polysaccharides. 

As used herein, a splice variant refers to a variant produced by differential 
processing of a primary transcript of genomic DNA that results in more than one 
5 type of mRNA. Splice variants of MTSPs are provided herein. 

As used herein, angiogenesis is intended to broadly encompass the 
totality of processes directly or indirectly involved in the establishment and 
maintenance of new vasculature (neovascularization), including, but not limited 
to, neovascularization associated with tumors. 
10 As used herein, anti-angiogenic treatment or agent refers to any 

therapeutic regimen and compound, when used alone or in combination with 
other treatment or compounds, that can alleviate, reduce, ameliorate, prevent, or 
place or maintain in a state of remission of clinical symptoms or diagnostic 
markers associated with undesired and/or uncontrolled angiogenesis. Thus, for 
15 purposes herein an anti-angiogenic agent refers to an agent that inhibits the 

establishment or maintenance of vasculature. Such agents include, but are not 
limited to, anti-tumor agents, and agents for treatments of other disorders 
associated with undesirable angiogenesis, such as diabetic retinopathies, 
restenosis, hyperproliferative disorders and others. 
20 As used herein, non-anti-angiogenic anti-tumor agents refer to anti-tumor 

agents that do not act primarily by inhibiting angiogenesis. 

As used herein, pro-angiogenic agents are agents that promote the 
establishment or maintenance of the vasculature. Such agents include agents 
for treating cardiovascular disorders, including heart attacks and strokes. 
25 As used herein, undesired and/or uncontrolled angiogenesis refers to 

pathological angiogenesis wherein the influence of angiogenesis stimulators 
outweighs the influence of angiogenesis inhibitors. As used herein, deficient 
angiogenesis refers to pathological angiogenesis associated with disorders where 
there is a defect in normal angiogenesis resulting in aberrant angiogenesis or an 
30 absence or substantial reduction in angiogenesis. 

As used herein, endotheliase refers to a mammalian protein, including 
humans, that has a transmembrane domain and is expressed on the surface of 
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endothelial cells and includes a protease domain, particularly an extracellular 
protease domain, and is preferably a serine protease. Thus, reference, for 
example, to endotheliase encompasses all proteins encoded by the endotheliase 
gene family, or an equivalent molecule obtained from any other source or that 
5 has been prepared synthetically or that exhibits the same activity. The 

endotheliase gene family are transmembrane proteases expressed in endothelial 
cells. Endotheliases are excluded from the MTSPs contemplated herein. 

As used herein, the protease domain of an endotheliase refers to the 
polypeptide portion of the endotheliase that is the extracellular portion that 

10 exhibits protease activity. The protease domain is a polypeptide that includes at 
least the minimum number of amino acids, generally more than 50 or 100, 
required for protease activity. Protease activity may be assessed empirically, 
such as by testing the polypeptide for its ability to act as a protease. Assays, 
such as the assays described in the EXAMPLES, employing a known substrate in 

15 place of the test compounds may be used. Furthermore, since proteases, 

particularly serine proteases, have characteristic structures and sequences or 
motifs, the protease domain may be readily identified by such structure and 
sequence or motif. 

As used herein, the protease domain of an MTSP protein refers to the 
20 protease domain of an MTSP that is located within or is the extracellular domain 
of an MTSP and exhibits serine proteolytic activity. Hence it is at least the 
minimal portion of the extracellular domain that exhibits proteolytic activity as 
assessed by standard assays in vitro. It refers, herein, to a single chain form 
heretofore thought to be inactive. 
25 Exemplary protease domains include at least a sufficient portion of sequences of 
amino acids set forth as amino acids 615-855 in SEQ ID No. 2 (encoded by 
nucleotides 1865-2587 in SEQ ID No. 1; see also SEQ ID Nos. 49 and 50) from 
MTSP1, amino acids 205-437 of SEQ ID NO. 4 from MTSP3, SEQ ID No. 6, 
which sets forth the protease domain of MTSP4, and amino acids 217-443 of 
30 SEQ ID No. 1 1 from MTSP6. Also contemplated are nucleic acid molecules that 
encode polypeptide that has proteolytic activity in an in vitro proteolysis assay 
and that have at least 80%, 85%, 90% or 95% sequence identity with the full 
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length of a protease domain of an MTSP protein, or that hybridize along their full 
length to a nucleic acids that encode a protease domain, particularly under 
conditions of moderate, generally high, stringency. 

For each of these protease domains, residues at the N-terminus can be 
5 critical for activity, since it has been shown that an Asp in the N-terminus of 
such proteases is essential for formation of the catalytically active conformation 
upon activation cleavage of the zymogen form of the protease. It is shown 
herein that the protease domain of the singles chain form of the protease is 
catalytically active. Hence the protease domain will require the N-terminal amino- 

10 acids; the c-terminus portion may be truncated. The amount that can be 
removed can be determined empirically by testing the protein for protease 
activity in an in vitro assays that assesses catalytic cleavage. 

Hence smaller portions of the protease domains, particularly the single 
chain domains, thereof that retain protease activity are contemplated. Such 

15 smaller versions will generally be C-terminal truncated versions of the protease 
domains. The protease domains vary in size and constitution, including 
insertions and deletions in surface loops. Such domains exhibit conserved 
structure, including at least one structural feature, such as the active site triad, 
primary specificity pocket, oxyanion hole and/or other features of serine protease 

20 domains of proteases. Thus, for purposes herein, the protease domain is a 
single chain portion of an MTSP, as defined herein, but is homologous in its 
structural features and retention of sequence of similarity or homology the 
protease domain of chymotrypsin or trypsin. Most significantly, the polypeptide 
will exhibit proteolytic activity as a single chain. 

25 As used herein, by homologous means about greater than 25% nucleic 

acid sequence identity, preferably 25% 40%, 60%, 80%, 90% or 95%. The 
terms "homology" and "identity" are often used interchangeably. In general, 
sequences are aligned so that the highest order match is obtained (see, e.g.: 
Computational Molecular Biology, Lesk, A.M., ed., Oxford University Press, New 

30 York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., 
Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part I, 
Griffin, A.M., and Griffin, H.G., eds., Humana Press, New Jersey, 1994; 
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Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; 
and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton 
Press, New York, 1991; Carillo et al. (1988) SIAM J Applied Math 48:1073). 
By sequence identity, the number of conserved amino acids are 
5 determined by standard alignment algorithms programs, and are used with 
default gap penalties established by each supplier. Substantially homologous 
nucleic acid molecules would hybridize typically at moderate stringency or at 
high stringency all along the length of the nucleic acid of interest. Also 
contemplated are nucleic acid molecules that contain degenerate codons in place 

10 of codons in the hybridizing nucleic acid molecule. 

Whether any two nucleic acid molecules have nucleotide sequences that 
are at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% "identical" can be 
determined using known computer algorithms such as the "FAST A" program, 
using for example, the default parameters as in Pearson et aL (1988) Proc. Natl. 

15 Acad. Sci. USA 85:2444 (other programs include the GCG program package 
(Devereux, J., et aL, Nucleic Acids Research 72fl):3B7 (1984)), BLASTP, 
BLASTN, FASTA (Atschul, S.F., era/., J Molec Biol 275:403 (1990); Guide to 
Huge Computers, Martin J. Bishop, ed., Academic Press, San Diego, 1994, and 
Carillo et aL (1 988) SIAM J Applied Math 45:1 073). For example, the BLAST 

20 function of the National Center for Biotechnology Information database may be 
used to determine identity. Other commercially or publicly available programs 
include, DNAStar "MegAlign" program (Madison, Wl) and the University of 
Wisconsin Genetics Computer Group (UWG) "Gap" program (Madison Wl)). 
Percent homology or identity of proteins and/or nucleic acid moleucles may be 

25 determined, for example, by comparing sequence information using a GAP 
computer program (e.g., Needleman et aL (1970)*/. MoL Biol. 48:443, as 
revised by Smith and Waterman ((1981) Adv. Appl. Math. 2:482). Briefly, the 
GAP program defines similarity as the number of aligned symbols (i.e., 
nucleotides or amino acids) which are similar, divided by the total number of 

30 symbols in the shorter of the two sequences. Default parameters for the GAP 
program may include: (1) a unary comparison matrix (containing a value of 1 for 
identities and 0 for non-identities) and the weighted comparison matrix of 
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Gribskov et al. (1986) NucL Acids Res. 14:6745, as described by Schwartz and 
Dayhoff, eds., ATLAS OF PROTEIN SEQUENCE AND STRUCTURE, National 
Biomedical Research Foundation, pp. 353-358 (1979); (2) a penalty of 3.0 for 
each gap and an additional 0.10 penalty for each symbol in each gap; and (3) no 
5 penalty for end gaps. 

Therefore, as used herein, the term "identity" represents a comparison 
between a test and a reference polypeptide or polynucleotide. For example, a 
test polypeptide may be defined as any polypeptide that is 90% or more 
identical to a reference polypeptide. As used herein, the term at least "90% 

10 identical to" refers to percent identities from 90 to 99.99 relative to the 

reference polypeptides. Identity at a level of 90% or more is indicative of the 
fact that, assuming for exemplification purposes' a test and reference 
polynucleotide length of 100 amino acids are compared. No more than 10% 
(i.e., 10 out of 100) amino acids in the test polypeptide differs from that of the 

15 reference polypeptides. Similar comparisons may be made between a test and 
reference polynucleotides. Such differences may be represented as point 
mutations randomly distributed over the entire length of an amino acid sequence 
or they may be clustered in one or more locations of varying length up to the 
maximum allowable, e.g. 10/100 amino acid difference (approximately 90% 

20 identity). Differences are defined as nucleic acid or amino acid substitutions, or 
deletions. At level of homologies or identities above about 85-90%, the result 
should be independent of the program and gap parameters set; such high levels 
of identity readily can be assess, often without relying on software. 

As used herein, primer refers to an oligonucleotide containing two or 

25 more deoxyribonucleotides or ribonucleotides, preferably more than three, from 
which synthesis of a primer extension product can be initiated. Experimental 
conditions conducive to synthesis include the presence of nucleoside 
triphosphates and an agent for polymerization and extension, such as DNA 
polymerase, and a suitable buffer, temperature and pH. 

30 As used herein, animals include any animal, such as, but are not limited 

to, goats, cows, deer, sheep, rodents, pigs and humans. Non-human animals, 
exclude humans as the contemplated animal. The MTSPs provided herein are 
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from any source, animal, plant, prokaryotic and fungal. Preferred MTSPs are of 
animal origin, preferably mammalian origin. 

As used herein, genetic therapy involves the transfer of heterologous 
DNA to the certain cells, target cells, of a mammal, particularly a human, with a 
5 disorder or conditions for which such therapy is sought. The DNA is introduced 
into the selected target cells in a manner such that the heterologous DNA is 
expressed and a therapeutic product encoded thereby is produced. 
Alternatively, the heterologous DNA may in some manner mediate expression of 
DNA that encodes the therapeutic product, or it may encode a product, such as 

10 a peptide or RNA that in some manner mediates, directly or indirectly, expression 
of a therapeutic product. Genetic therapy may also be used to deliver nucleic 
acid encoding a gene product that replaces a defective gene or supplements a 
gene product produced by the mammal or the cell in which it is introduced. The 
introduced nucleic acid may encode a therapeutic compound, such as a growth 

15 factor inhibitor thereof, or a tumor necrosis factor or inhibitor thereof, such as a 
receptor therefor, that is not normally produced in the mammalian host or that is 
not produced in therapeutically effective amounts or at a therapeutically useful 
time. The heterologous DNA encoding the therapeutic product may be modified 
prior to introduction into the cells of the afflicted host in order to enhance or 

20 otherwise alter the product or expression thereof. Genetic therapy may also 
involve delivery of an inhibitor or repressor or other modulator of gene 
expression. 

As used herein, heterologous DNA is DNA that encodes RNA and proteins 
that are not normally produced in vivo by the cell in which it is expressed or that 

25 mediates or encodes mediators that alter expression of endogenous DNA by 
affecting transcription, translation, or other regulatable biochemical processes. 
Heterologous DNA may also be referred to as foreign DNA. Any DNA that one 
of skill in the art would recognize or consider as heterologous or foreign to the 
cell in which is expressed is herein encompassed by heterologous DNA. 

30 Examples of heterologous DNA include, but are not limited to, DNA that encodes 
traceable marker proteins, such as a protein that confers drug resistance, DNA 
that encodes therapeutically effective substances, such as anti-cancer agents, 
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enzymes and hormones, and DNA that encodes other types of proteins, such as 
antibodies. Antibodies that are encoded by heterologous DNA may be secreted 
or expressed on the surface of the cell in which the heterologous DNA has been 
introduced. 

5 Hence, herein heterologous DNA or foreign DNA, includes a DNA 

molecule not present in the exact orientation and position as the counterpart 
DNA molecule found in the genome. It may also refer to a DNA molecule from 
another organism or species {i.e., exogenous). 

As used herein, a therapeutically effective product is a product that is 
10 encoded by heterologous nucleic acid, typically DNA, that, upon introduction of 
the nucleic acid into a host, a product is expressed that ameliorates or eliminates 
the symptoms, manifestations of an inherited or acquired disease or that cures 
the disease. 

As used herein, recitation that a polypeptide consists essentially of the 
15 protease domain means that the only MTSP portion of the polypeptide is a 

protease domain or a catalytically active portion thereof. The polypeptide may 
optionally, and generally will, include additional non-MTSP-derived sequences of 
amino acids. 

As used herein, cancer or tumor treatment or agent refers to any 
20 therapeutic regimen and/or compound that, when used alone or in combination 
with other treatments or compounds, can alleviate, reduce, ameliorate, prevent, 
or place or maintain in a state of remission of clinical symptoms or diagnostic 
markers associated with deficient angiogenesis. 

As used herein, domain refers to a portion of a molecule, e.g., proteins 
25 or nucleic acids, that is structurally and/or functionally distinct from other 
portions of the molecule. 

As used herein, protease refers to an enzyme catalyzing hydrolysis of 
proteins or peptides. For purposes herein, the protease domain is a single chain 
form of an MTSP protein. For MTSP3 and MTSP4 the protease domain also 
30 includes two chain forms. 
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As used herein, catalytic activity refers to the activity of the MTSP as a 
protease as assessed in in vitro proteolytic assays that detect proteolysis of a 
selected substrate. 

As used herein, nucleic acids include DNA, RNA and analogs thereof, 
5 including protein nucleic acids (PNA) and mixture thereof. Nucleic acids can be 
single or double stranded. When referring to probes or primers, optionally 
labeled, with a detectable label, such as a fluorescent or radiolabel, single- 
stranded molecules are contemplated. Such molecules are typically of a length 
such that they are statistically unique and of a low copy number (typically less 

10 than 5, preferably less than 3) for probing or priming a library. Generally a probe 
or primer contains at least 14, 16 or 30 contiguous sequence complementary to 
or identical to a gene of interest. Probes and primers can be 10, 20, 30, 50, 
100 or more nucleic acids long. 

As used herein, nucleic acid encoding a fragment or portion of an MTSP 

15 refers to a nucleic acid encoding only the recited fragment or portion of MTSP, 
and not the other contiguous portions of the MTSP. 

As used herein, heterologous or foreign DNA and RNA are used 
interchangeably and refer to DNA or RNA that does not occur naturally as part of 
the genome in which it Is present or which is found in a location or locations in 

20 the genome that differ from that in which it occurs in nature. Heterologous 

nucleic acid is generally not endogenous to the cell into which it is introduced, 
but has been obtained from another cell or prepared synthetically. Generally, 
although not necessarily, such nucleic acid encodes RNA and proteins that are 
not normally produced by the cell in which it is expressed. Any DNA or RNA 

25 that one of skill in the art would recognize or consider as heterologous or foreign 
to the cell in which it is expressed is herein encompassed by heterologous DNA. 
Heterologous DNA and RNA may also encode RNA or proteins that mediate or 
alter expression of endogenous DNA by affecting transcription, translation, or 
other regulatable biochemical processes. 

30 As used herein, operative linkage of heterologous DNA to regulatory and 

effector sequences of nucleotides, such as promoters, enhancers, transcriptional 
and translational stop sites, and other signal sequences refers to the relationship 
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between such DNA and such sequences of nucleotides. For example, operative 
linkage of heterologous DNA to a promoter refers to the physical relationship 
between the DNA and the promoter such that the transcription of such DNA is 
initiated from the promoter by an RNA polymerase that specifically recognizes, 
5 binds to and transcribes the DNA in reading frame. 

As used herein, a sequence complementary to at least a portion of an 
RNA, with reference to antisense oligonucleotides, means a sequence having 
sufficient complementarily to be able to hybridize with the RNA, preferably under 
moderate or high stringency conditions, forming a stable duplex; in the case of 

10 double-stranded MTSP antisense nucleic acids, a single strand of the duplex 
DNA may thus be tested, or triplex formation may be assayed. The ability to 
hybridize depends on the degree of complementarily and the length of the 
antisense nucleic acid. Generally, the longer the hybridizing nucleic acid, the 
more base mismatches with a MTSP encoding RNA it can contain and still form 

15 a stable duplex (or triplex, as the case may be). One skilled in the art can 
ascertain a tolerable degree of mismatch by use of standard procedures to 
determine the melting point of the hybridized complex. 

For purposes herein, conservative amino acid substitutions may be made 
in any of MTSPs and protease domains thereof provided that the resulting 

20 protein exhibits protease activity. Conservative amino acid substitutions, such 
as those set forth in Table 1 , are those that do not eliminate proteolytic activity. 
Suitable conservative substitutions of amino acids are known to those of skill in 
this art and may be made generally without altering the biological activity of the 
resulting molecule. Those of skill in this art recognize that, in general, single 

25 amino acid substitutions in non-essential regions of a polypeptide do not 

substantially alter biological activity {see, e.g. . Watson et aL Molecuiar Biology 
of the Gene, 4th Edition, 1987, The Bejacmin/Cummings Pub. co., p. 224). Also 
included within the definition, is the catalytically active fragment of an MTSP, 
particularly a single chain protease portion. Conservative amino acid 

30 substitutions are made, for example, in accordance with those set forth in 
TABLE 1 as follows: 
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TABLE 1 

0ri9 i?fJ , r «f ue Conservative substitution 

A,a (A ' GJy; Ser, Abu 

Ar 9 ,R) Lys,orn 

Asn < N > Gin; His 

Cys (C) Sef 

G,n < Q > Asn 



Glu (E) 



Asp 



10 His (H) 



G 'V < G > Ala; Pro 



Asn; Gin 



^ 5 Ornitine 



l,e (,) Leu; Val; Met; Nle; Nva 
Leu (L) "e; Val; Met; Nle; Nv 

Lys (K) Arg; Gin; Glu 

Met (M) Leu; Tyr; lie; NLe Val 



20 Tyr (Y) 



Lys; Arg 

Pne (F) Met; Leu; Tyr 

Ser (S) Tnr 

Thr (T) Ser 

Trp (W) j 



Trp; Phe 



25 



30 



35 



n+u u • • Val (V> ,,e ' Leu; Met; Nle; Nv 

Other substitutions are also permissible and may be determined empirically or in 

accord with known conservative substitutions. 

As used herein, Abu is 2-aminobutyric acid; Orn is ornithine. 

As used herein, the amino acids, which occur in the various amino acid 
sequences appearing herein, are identified according to their well-known, three- 
letter or one-letter abbreviations. The nucleotides, which occur in the various 
DMA fragments, are designated with the standard single-letter designations used 
routinely in the art. 

As used herein, a splice variant refers to a variant produced by differential 
processing of a primary transcript of genomic DNA that results in more than one 
type of mRNA. 

As used herein, a probe or primer based on a nucleotide sequence 
disclosed herein, includes at least 10, 14, preferably at least 16 or 30 or 100 
contiguous sequence of nucleotides of SEQ ID Nos. 1 , 3, 5, 7, 9 or 1 1 . 

As used herein, amelioration of the symptoms of a particular disorder by 
adm-nistration of a particular pharmaceutical composition refers to any .essening. 
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whether permanent or temporary, lasting or transient that can be attributed to or 
associated with administration of the composition. 

As used herein, antisense polynucleotides refer to synthetic sequences of 
nucleotide bases complementary to mRNA or the sense strand of double 
5 stranded DNA. Admixture of sense and antisense polynucleotides under 
appropriate conditions leads to the binding of the two molecules, or 
hybridization. When these polynucleotides bind to (hybridize with) mRNA, 
inhibition of protein synthesis (translation) occurs. When these polynucleotides 
bind to double stranded DNA, inhibition of RNA synthesis (transcription) occurs. 

10 The resulting inhibition of translation and/or transcription leads to an inhibition of 
the synthesis of the protein encoded by the sense strand. Antisense nucleic 
acid molecule typically contain a sufficient number of nucleotides to specifically 
bind to a target nucleic acid, generally at least 5 contiguous nucleotides, often at 
least 14 or 16 or 30 contiguous nucleotides or modified nucleotides 

15 complementary to the coding portion of a nucleic acid molecule that encodes a 
gene of interest, for example, nucleic acid encoding a single chain protease 
domain of an MTSP. 

As used herein, an array refers to a collection of elements, such as 
antibodies, containing three or more members. An addressable array is one in 

20 which the members of the array are identifiable, typically by position on a solid 
phase support. Hence, in general the members of the array will be immobilized 
to discrete identifiable loci on the surface of a solid phase. 

As used herein, antibody refers to an immunoglobulin, whether natural or 
partially or wholly synthetically produced, including any derivative thereof that 

25 retains the specific binding ability the antibody. Hence antibody includes any 
protein having a binding domain that is homologous or substantially homologous 
to an immunoglobulin binding domain. Antibodies include members of any 
immunoglobulin claims, including IgG, IgM, IgA, IgD and IgE. 

As used herein, antibody fragment refers to any derivative of an antibody 

30 that is less then full length, retaining at least a portion of the full-length 

antibody's specific binding ability. Examples of antibody fragments include, but 
are not limited to, Fab, Fab', F(ab) 2/ single-chain Fvs (scFV), FV, dsFV diabody 
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and Fd fragments. The fragment can include multiple chains linked together, 
such as by disulfide bridges. An antibody fragment generally contains at least 
about 50 amino acids and typically at least 200 amino acids. 

As used herein, an Fv antibody fragment is composed of one variable 
5 heavy domain (V H ) and one variable light domain linked by noncovalent 
interactions. 

As used herein, a dsFV refers to an Fv with an engineered intermolecular 
disulfide bond, which stabilizes the V M -V L pair. 

As used herein, an F(ab) 2 fragment is an antibody fragment that results 
10 from digestion of an immunoglobulin with pepsin at pH 4.0-4.5; it may be 
recombinantly produced. 

As used herein, Fab fragments is an antibody fragment that results from 
digestion of an immunoglobulin with papain; it may be recombinantly produced. 

As used herein, scFVs refer to antibody fragments that contain a variable 
15 light chain {V L ) and variable heavy chain (V H ) covalently connected by a 
polypeptide linker in any order. The linker is of a length such that the two 
variable domains are bridged without substantial interference. Preferred linkers 
are {Gly-Ser) n residues with some Glu or Lys residues dispersed throughout to 
increase solubility. 

20 As used herein, humanized antibodies refer to antibodies that are 

modified to include human sequences of amino acids so that administration to a 
human will not provoke an immune response. Methods for preparation of such 
antibodies are known. For example, the hybridoma that expresses the 
monoclonal antibody is altered by recombinant DNA techniques to express an 

25 antibody in which the amino acid composition of the non-variable regions is 

based on human antibodies. Computer programs have been designed to identify 
such regions. 

As used herein, diabodies are dimeric scFV; diabodies typically have 
shorter peptide linkers than scFvs, and they preferentially dimerize. 
30 As used herein, humanized antibodies refer to antibodies that are 

modified to include human sequences of amino acids so that administration to a 
human will not provoke an immune response. Methods for preparation of such 
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antibodies are known. For example, the hybridoma that expresses the 
monoclonal antibody is altered by recombinant DNA techniques to express an 
antibody in which the amino acid composition of the non-variable regions is 
based on human antibodies. Computer programs have been designed to identify 
5 such regions. 

As used herein, production by recombinant means by using recombinant 
DNA methods means the use of the well known methods of molecular biology 
for expressing proteins encoded by cloned DNA. 

As used herein the term assessing is intended to include 

10 quantitative and qualitative determination in the sense of obtaining an 

absolute value for the activity of an MTSP, or a domain thereof, present in the 
sample, and also of obtaining an index, ratio, percentage, visual or other value 
indicative of the level of the activity. Assessment may be direct or indirect and 
the chemical species actually detected need not of course be the proteolysis 

15 product itself but may for example be a derivative thereof or some further 
substance. 

As used herein, biological activity refers to the ]n vivo activities of a 
compound or physiological responses that result upon in vivo administration of a 
compound, composition or other mixture. Biological activity, thus, encompasses 

20 therapeutic effects and pharmaceutical activity of such compounds, 

compositions and mixtures. Biological activities may be observed in in vitro 
systems designed to test or use such activities. Thus, for purposes herein the 
biological activity of a lucrferase is its oxygenase activity whereby, upon 
oxidation of a substrate, light is produced. 

25 As used herein, a combination refers to any association between two or 

among more items. 

As used herein, a composition refers to any mixture. It may be a 
solution, a suspension, liquid, powder, a paste, aqueous, non-aqueous or any 
combination thereof. 
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As used herein, a conjugate refers to the compounds provided herein that 
include one or more MTSPs, particularly single chain protease domains thereof, 
and one or more targeting agents. These conjugates include those produced by 
recombinant means as fusion proteins, those produced by chemical means, such 
5 as by chemical coupling, through, for example, coupling to sulfhydryl groups, 
and those produced by any other method whereby at least one MTSP, or a 
domain thereof, is linked, directly or indirectly via linker(s) to a targeting agent. 

As used herein, a targeting agent, is any moiety, such as a protein or 
effective portion thereof, that provides specific binding of the Conjugate to a cell 
10 surface receptor, which, preferably, internalizes the conjugate or MTSP portion 
thereof. A targeting agent may also be one that promotes or facilitates, for 
example, affinity isolation or purification of the conjugate; attachment of the 
conjugate to a surface; or detection of the conjugate or complexes containing 
the conjugate. 

15 As used herein, an antibody conjugate refers to a conjugate in which the 

targeting agent is an antibody. 

As used herein, humanized antibodies refer to antibodies that are 
modified to include human sequences of amino acids so that administration to a 
human will not provoke an immune response. Methods for preparation of such 

20 antibodies are known. For example, the hybridoma that expresses the 

monoclonal antibody is altered by recombinant DNA techniques to express an 
antibody in which the amino acid composition of the non-variable regions is 
based on human antibodies. Computer programs have been designed to identify 
such regions. 

25 As used herein, derivative or analog of a molecule refers to a portion 

derived from or a modified version of the molecule. 

As used herein, fluid refers to any composition that can flow. Fluids thus 
encompass compositions that are in the form of semi-solids, pastes, solutions, 
aqueous mixtures, gels, lotions, creams and other such compositions. 

30 As used herein, an effective amount of a compound for treating a 

particular disease is an amount that is sufficient to ameliorate, or in some 
manner reduce the symptoms associated with the disease. Such amount may 
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be administered as a single dosage or may be administered according to a 
regimen, whereby it is effective. The amount may cure the disease but, 
typically, is administered in order to ameliorate the symptoms of the disease. 
Repeated administration may be required to achieve the desired amelioration of 
5 symptoms. 

As used herein equivalent, when referring to two sequences of nucleic 
acids means that the two sequences in question encode the same sequence of 
amino acids or equivalent proteins. When equivalent is used in referring to two 
proteins or peptides, it means that the two proteins or peptides have 
10 substantially the same amino acid sequence with only conservative amino acid 
substitutions (see, e.g., Table 1, above) that do not substantially alter the 
activity or function of the protein or peptide. When equivalent refers to a 
property, the property does not need to be present to the same extent [ e.g. , two 
peptides can exhibit different rates of the same type of enzymatic activity], but 
15 the activities are preferably substantially the same. Complementary, when 
referring to two nucleotide sequences, means that the two sequences of 
nucleotides are capable of hybridizing, preferably with less than 25%, more 
preferably with less than 15%, even more preferably with less than 5%, most 
preferably with no mismatches between opposed nucleotides. Preferably the 
20 two molecules will hybridize under conditions of high stringency. 

As used herein, an agent that modulates the activity of a protein or 
expression of a gene or nucleic acid either decreases or increases or otherwise 
alters the activity of the protein or, in some manner up- or down-regulates or 
otherwise alters expression of the nucleic acid in a cell. 

As used herein, inhibitor of an the activity of an MTSP encompasses any 
substances that prohibit or decrease production, post-translational 
modification(s), maturation, or membrane localization of the MTSP or any 
substances that interfere with or decrease the proteolytic efficacy of thereof, 
particular of a single chain form in vitro. 

As used herein, a method for treating or preventing neoplastic disease 
means that any of the symptoms, such as the tumor, metastasis thereof, the 
vascularization of the tumors or other parameters by which the disease is 



25 



30 
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characterized are reduced, ameliorated, prevented, placed in a state of remission, 
or maintained in a state of remission. It also means that the hallmarks of 
neoplastic disease and metastasis may be eliminated, reduced or prevented by 
the treatment. Non-limiting examples of the hallmarks include uncontrolled 
5 degradation of the basement membrane and proximal extracellular matrix, 

migration, division, and organization of the endothelial cells into new functioning 
capillaries, and the persistence of such functioning capillaries. 

As used herein, operatively linked or operationally associated refers to the 
functional relationship of DNA with regulatory and effector sequences of 

10 nucleotides, such as promoters, enhancers, transcriptional and translational stop 
sites, and other signal sequences. For example, operative linkage of DNA to a 
promoter refers to the physical and functional relationship between the DNA and 
the promoter such that the transcription of such DNA is initiated from the 
promoter by an RNA polymerase that specifically recognizes, binds to and 

15 transcribes the DNA. In order to optimize expression and/or in vitro 

transcription, it may be necessary to remove, add or alter 5' untranslated 
portions of the clones to eliminate extra, potential inappropriate alternative 
translation initiation {i.e., start) codons or other sequences that may interfere 
with or reduce expression, either at the level of transcription or translation. 

20 Alternatively, consensus ribosome binding sites (see, e.g. , Kozak J. Biol. Chem. 
256:19867-19870 (1991)) can be inserted immediately 5' of the start codon 
and may enhance expression. The desirability of (or need for) such modification 
may be empiricaliy determined. 

As used herein, pharmaceutically acceptable salts, esters or other 

25 derivatives of the conjugates include any salts, esters or derivatives that may be 
readily prepared by those of skill in this art using known methods for such 
derivatization and that produce compounds that may be administered to animals 
or humans without substantial toxic effects and that either are pharmaceutically 
active or are prodrugs. 

30 As used herein, a prodrug is a compound that, upon in vivo 

administration, is metabolized or otherwise converted to the biologically, 
pharmaceutically or therapeutically active form of the compound. To produce a 
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prodrug, the pharmaceutically active compound is modified such that the active 
compound will be regenerated by metabolic processes. The prodrug may be 
designed to alter the metabolic stability or the transport characteristics of a drug, 
to mask side effects or toxicity, to improve the flavor of a drug or to alter other 
5 characteristics or properties of a drug. By virtue of knowledge of 

pharmacodynamic processes and drug metabolism jn vivo , those of skill in this 
art, once a pharmaceutically active compound is known, can design prodrugs of 
the compound (see, e.g. , Nogrady (1985) Medicinal Chemistry A Biochemical 
Approach , Oxford University Press, New York, pages 388-392). 

10 As used herein, a drug identified by the screening methods provided 

herein refers to any compound that is a candidate for use as a therapeutic or as 
lead compound for designed a therapeutic. Such compounds can be small 
molecules, including small organic molecules, peptides, peptide mimetics, 
antisense molecules, antibodies, fragments of antibodies, recombinant antibodies 

15 and other such compound which can serve as drug candidate or lead compound. 

As used herein, production by recombinant means by using recombinant 
DIMA methods means the use of the well known methods of molecular biology 
for expressing proteins encoded by cloned DNA. 

As used herein, a promoter region or promoter element refers to a 

20 segment of DNA or RNA that controls transcription of the DNA or RNA to which 
it is operatively linked. The promoter region includes specific sequences that are 
sufficient for RNA polymerase recognition, binding and transcription initiation. 
This portion of the promoter region is referred to as the promoter. In addition, 
the promoter region includes sequences that modulate this recognition, binding 

25 and transcription initiation activity of RNA polymerase. These sequences may be 
c/s acting or may be responsive to trans acting factors. Promoters, depending 
upon the nature of the regulation, may be constitutive or regulated. Exemplary 
promoters contemplated for use in prokaryotes include the bacteriophage T7 and 
T3 promoters. 

30 As used herein, a receptor refers to a molecule that has an affinity for a 

given ligand. Receptors may be naturally-occurring or synthetic molecules. 
Receptors may also be referred to in the art as anti-ligands. As used herein, the 
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receptor and anti-ligand are interchangeable. Receptors can be used in their 
unaltered state or as aggregates with other species. Receptors may be attached, 
covalently or noncovalently, or in physical contact with, to a binding member, 
either directly or indirectly via a specific binding substance or linker. Examples 
5 of receptors, include, but are not limited to: antibodies, cell membrane receptors 
surface receptors and internalizing receptors, monoclonal antibodies and antisera 
reactive with specific antigenic determinants [such as on viruses, cells, or other 
materials], drugs, polynucleotides, nucleic acids, peptides, cofactors, lectins, 
sugars, polysaccharides, cells, cellular membranes, and organelles. 
10 Examples of receptors and applications using such receptors, include but 

are not restricted to: 

a) enzymes: specific transport proteins or enzymes essential to survival 
of microorganisms, which could serve as targets for antibiotic [iigand] selection; 

b) antibodies: identification of a ligand-binding site on the antibody 
15 molecule that combines with the epitope of an antigen of interest may be 

investigated; determination of a sequence that mimics an antigenic epitope may 
lead to the development of vaccines of which the immunogen is based on one or 
more of such sequences or lead to the development of related diagnostic agents 
or compounds useful in therapeutic treatments such as for auto-immune diseases 
20 c) nucleic acids: identification of Iigand, such as protein or RNA, binding 

sites; 

d) catalytic polypeptides: polymers, preferably polypeptides, that are 
capable of promoting a chemical reaction involving the conversion of one or 
more reactants to one or more products; such polypeptides generally include a 

25 binding site specific for at least one reactant or reaction intermediate and an 
active functionality proximate to the binding site, in which the functionality is 
capable of chemically modifying the bound reactant [see, e.g. , U.S. Patent No. 
5,215,899]; 

e) hormone receptors: determination of the ligands that bind with high 
30 affinity to a receptor is useful in the development of hormone replacement 

therapies; for example, identification of ligands that bind to such receptors may 
lead to the development of drugs to control blood pressure; and 
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f) opiate receptors: determination of ligands that bind to the opiate 
receptors in the brain is useful in the development of less-addictive replacements 
for morphine and related drugs. 

As used herein, sample refers to anything which may contain an analyte 
5 for which an analyte assay is desired. The sample may be a biological sample, 
such as a biological fluid or a biological tissue. Examples of biological fluids 
include urine, blood, plasma, serum, saliva, semen, stool, sputum, cerebral spinal 
fluid, tears, mucus, amniotic fluid or the like. Biological tissues are aggregate of 
cells, usually of a particular kind together with their intercellular substance that 
10 form one of the structural materials of a human, animal, plant, bacterial, fungal 
or viral structure, including connective, epithelium, muscle and nerve tissue. 
Examples of biological tissues also include organs, tumors, lymph nodes, arteries 
and individual cell(s). 

As used herein: stringency of hybridization in determining percentage 
15 mismatch is as follows: 

1) high stringency: 0.1 x SSPE, 0.1% SDS, 65 °C 

2) medium stringency: 0.2 x SSPE, 0.1% SDS, 50°C 

3) low stringency: 1 .0 x SSPE, 0.1 % SDS, 50 °C 

Those of skill in this art know that the washing step selects for stable 
20 hybrids and also know the ingredients of SSPE {see, e.g., Sambrook, E.F. 

Fritsch, T. Maniatis, in: Molecular Cloning, A Laboratory Manual . Cold Spring 
Harbor Laboratory Press (1989), vol. 3, p. B.13, see, also, numerous catalogs 
that describe commonly used laboratory solutions). SSPE is pH 7.4 phophate- 
buffered 0.18 NaCI. Further, those of skill in the art recognize that the stability 
25 of hybrids is determined by T m , which is a function of the sodium ion 

concentration and temperature (T m = 81.5° C-16.6(log lo [Na + ]) + 0.41 (%G + Q- 
600/i)), so that the only parameters in the wash conditions critical to hybrid 
stability are sodium ion concentration in the SSPE (or SSC) and temperature. 
It is understood that equivalent stringencies may be achieved using 
30 alternative buffers, salts and temperatures. By way of example and not 

limitation, procedures using conditions of low stringency are as follows (see also 
Shilo and Weinberg, Proc. Nat/. Acad, ScL USA, 78:6789-6792 (1981)): Filters 
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containing DNA are pretreated for 6 hours at 40°C in a solution containing 35% 
formamide, 5X SSC, 50 mM Tris-HCI {pH 7.5), 5 mM EDTA, 0.1 % PVP, 0.1 % 
Ficoll, 1% BSA, and 500 /yg/ml denatured salmon sperm DNA (10X SSC is 1.5 
M sodium chloride, and 0.15 M sodium citrate, adjusted to a pH of 7). 
5 Hybridizations are carried out in the same solution with the following 

modifications: 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100//g/ml salmon sperm 
DNA, 10% (wt/vol) dextran sulfate, and 5-20 X 10 6 cpm 32 P-labeled probe is 
used. Filters are incubated in hybridization mixture for 18-20 hours at 40°C, 
and then washed for 1.5 hours at 55°C in a solution containing 2X SSC, 25 mM 
10 Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1 % SDS. The wash solution is replaced 
with fresh solution and incubated an additional 1.5 hours at 60°C. Filters are 
blotted dry and exposed for autoradiography. If necessary, filters are washed for 
a third time at 65-68 °C and reexposed to film. Other conditions of low 
stringency which may be used are well known in the art {e.g., as employed for 
15 cross-species hybridizations). 

By way of example and not way of limitation, procedures using 
conditions of moderate stringency is provided. For example, but not limited to, 
procedures using such conditions of moderate stringency are as follows: Filters 
containing DNA are pretreated for 6 hours at 55 °C in a solution containing 6X 
20 SSC, 5X Denhart's solution, 0.5% SDS and 100 /yg/ml denatured salmon sperm 
DNA. Hybridizations are carried out in the same solution and 5-20 X 10 6 cpm 
32 P-labeled probe is used. Filters are incubated in hybridization mixture for 18-20 
hours at 55 °C, and then washed twice for 30 minutes at 60°C in a solution 
containing 1X SSC and 0.1 % SDS. Filters are blotted dry and exposed for 
25 autoradiography. Other conditions of moderate stringency which may be used 
are well-known in the art. Washing of filters is done at 37 °C for 1 hour in a 
solution containing 2X SSC, 0.1% SDS. 

By way of example and not way of limitation, procedures using conditions 
of high stringency are as follows: Prehybridization of filters containing DNA is 
30 carried out for 8 hours to overnight at 65 °C in buffer composed of 6X SSC, 

50 mM Tris-HCI (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, 
and 500 ^g/ml denatured salmon sperm DNA. Filters ar hybridized for 48 hours 
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at 65 °C in prehybridization mixture containing 100jt/g/ml denatured salmon 
sperm DNA and 5-20 X 10 6 cpm of 32 P-labeled probe. Washing of filters is done 
at 37°C for 1 hour in a solution containing 2X SSC, 0.01% PVP, 0.01 % Ficoll, 
and 0.01% BSA. This is followed by a wash in 0.1 X SSC at 50°C for 45 
5 minutes before autoradiography. Other conditions of high stringency which may 
be used are well known in the art. 

The term substantially identical or homologous or similar varies with the 
context as understood by those skilled in the relevant art and generally means at 
least 70%, preferably means at least 80%, more preferably at least 90%, and 
10 most preferably at least 95% identity. 

As used herein, substantially identical to a product means sufficiently 
similar so that the property of interest is sufficiently unchanged so that the 
substantially identical product can be used in place of the product. 

As used herein, substantially pure means sufficiently homogeneous to 
15 appear free of readily detectable impurities as determined by standard methods 
of analysis, such as thin layer chromatography (TLC), gel electrophoresis and 
high performance liquid chromatography (HPLC), used by those of skill in the art 
to assess such purity, or sufficiently pure such that further purification would 
not detectably alter the physical and chemical properties, such as enzymatic and 
20 biological activities, of the substance. Methods for purification of the 

compounds to produce substantially chemically pure compounds are known to 
those of skill in the art. A substantially chemically pure compound may, 
however, be a mixture of stereoisomers or isomers. In such instances, further 
purification might increase the specific activity of the compound. 
25 As used herein, target cell refers to a cell that expresses an MTSP in vivo. 

As used herein, test substance refers to a chemically defined compound 
(e.g., organic molecules, inorganic molecules, organic/inorganic molecules, 
proteins, peptides, nucleic acids, oligonucleotides, lipids, polysaccharides, 
saccharides, or hybrids among these molecules such as glycoproteins, etc.) or 
30 mixtures of compounds (e.g., a library of test compounds, natural extracts or 
culture supernatants, etc.) whose effect on an MTSP, particularly a single chain 
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form that includes the protease domain or a sufficient portion thereof for 
activity, as determined by in vitro method, such as the assays provided herein. 

As used herein, the terms a therapeutic agent, therapeutic regimen, 
radioprotectant, chemotherapeutic mean conventional drugs and drug therapies, 
5 including vaccines, which are known to those skilled in the art. Radiotherapeutic 
agents are well known in the art. 

As used herein, treatment means any manner in which the symptoms of a 
conditions, disorder or disease are ameliorated or otherwise beneficially altered. 
Treatment also encompasses any pharmaceutical use of the compositions herein. 

10 As used herein, vector (or plasmid) refers to discrete elements that are 

used to introduce heterologous DNA into cells for either expression or replication 
thereof. The vectors typically remain episomal, but may be designed to effect 
integration of a gene or portion thereof into a chromosome of the genome. Also 
contemplated are vectors that are artificial chromosomes, such as yeast artificial 

15 chromosomes and mammalian artificial chromosomes. Selection and use of such 
vehicles are well known to those of skill in the art. An expression vector 
includes vectors capable of expressing DNA that is operatively linked with 
regulatory sequences, such as promoter regions, that are capable of effecting 
expression of such DNA fragments.. Thus, an expression vector refers to a 

20 recombinant DNA or RNA construct, such as a plasmid, a phage, recombinant 
virus or other vector that, upon introduction into an appropriate host cell, results 
in expression of the cloned DNA. Appropriate expression vectors are well 
known to those of skill in the art and include those that are replicable in 
eukaryotic cells and/or prokaryotic cells and those that remain episomal or those 

25 which integrate into the host cell genome. 

As used herein, protein binding sequence refers to a protein or peptide 
sequence that is capable of specific binding to other protein or peptide 
sequences generally, to a set of protein or peptide sequences or to a particular 
protein or peptide sequence. 

30 As used herein, epitope tag refers to a short stretch of amino acid 

residues corresponding to an epitope to facilitate subsequent biochemical and 
immunological analysis of the epitope tagged protein or peptide. Epitope tagging 
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is achieved by appending the sequence of the epitope tag to the protein- 
encoding sequence in an appropriate expression vector. Epitope tagged proteins 
can be affinity purified using highly specific antibodies raised against the tags. 

As used herein, metal binding sequence refers to a protein or peptide 
5 sequence that is capable of specific binding to metal ions generally, to a set of 
metal ions or to a particular metal ion. 

As used herein, a composition refers to a any mixture. It may be a 

* 

solution, a suspension, liquid, powder, a paste, aqueous, non-aqueous or any 
combination thereof. 

10 As used herein, a combination refers to any association between two or 

among more items. 

As used herein, fluid refers to any composition that can flow. Fluids thus 

encompass compositions that are in the form of semi-solids, pastes, solutions, 

aqueous mixtures, gels, lotions, creams and other such compositions. 
15 As used herein, a cellular extract refers to a preparation or fraction which 

is made from a lysed or disrupted cell. 

'As used herein, an agent is said to be randomly selected when the agent is 
chosen randomly without considering the specific sequences involved in the 
association of a protein alone or with its associated substrates, binding partners, 

20 etc. An example of randomly selected agents is the use a chemical library or a 
peptide combinatorial library, or a growth broth of an organism. 

As used herein, an agent is the to be rationally selected or designed when 
the agent is chosen on a non-random basis which takes into account the 
sequence of the target site and/or its conformation in connection with the 

25 . agent's action. As described in the Examples, there are proposed binding sites 
for serine protease and (catalytic) sites in the protein having SEQ ID NO:3 or 
SEQ ID IM0:4. Agents can be rationally selected or rationally designed by 
utilizing the peptide sequences that make up these sites. For example, a 
rationally selected peptide agent can be a peptide whose amino acid sequence is 

30 identical to the ATP or calmodulin binding sites or domains. 

For clarity of disclosure, and not by way of limitation, the detailed 
description is divided into the subsections that follow. 
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B. MTSP PROTEINS, MUTE1NS, DERIVATIVES AND ANALOGS THEREOF 
MTSPs 

The MTSPs are a family of transmembrane serine proteases that are 
found in mammals and also other species that share a number of common 
5 structural features including: a proteolytic extracellular C-terminal domain; a 

transmembrane domain, with a hydrophobic domain near the N-terminus; a short 
cytoplasmic domain; and a variable length stem region containing modular 
domains. The proteolytic domains share sequence homology including 
conserved his, asp, and ser residues necessary for catalytic activity that are 

10 present in conserved motifs- The MTSPs are synthesized as zymogens, and 
activated to double chain forms by cleavage. It is shown herein that the single 
chain proteolytic domain can function in vitro and, hence is useful in in vitro 
assays for identifying agents that modulate the activity of members of this 
family. Also provided are family members designated MTSP3, MTSP4 and an 

15 MTSP6 variant. 

The MTSP family is a target for therapeutic intervention and also some, 
may serve as diagnostic markers for tumor development, growth and/or 
progression. As discussed, the members of this family are involved in proteolytic 
processes that are implicated in tumor development, growth and/or progression. 

20 This implication is based upon their functions as proteolytic enzymes in 

processes related to ECM degradative pathways. In addition, their levels of 
expression or level of activation or their apparent activity resulting from 
substrate levels or alterations in substrates and levels thereof differs in tumor 
cells and non-tumor cells in the same tissue. Hence, protocols and treatments 

25 that alter their activity, such as their proteolytic activities and roles in signal 
transduction, and/or their expression, such as by contacting them with a 
• compound that modulates their activity and/or expression, could impact tumor 
development, growth and/or progression. Also, in some instances, the level of 
activation and/or expression may be altered in tumors, such as lung carcinoma, 

30 colon adenocarcinoma and ovarian carcinoma. 

The MTSP may serve as a diagnostic marker for tumors. It is shown 
herein, that MTSP3 and MTSP4 and the MTSP6 variant provided herein are 
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expressed and/or activated in certain tumors; hence their activation or 
expression may serve as a diagnostic marker for tumor development, growth 
and/or progression. In other instances the MTSP protein can exhibit altered 
activity by virtue of a change in activity or expression of a co-factor therefor or a 
5 substrate therefor. In addition, in some instances, these MTSPS and/or variants 
thereof may be shed from cell surfaces. Detection of the shed MTSPS, 
particularly the extracellular domains, in body fluids, such as serum, blood, 
saliva, cerebral spinal fluid, synovial fluid and interstitial fluids, urine, sweat and 
other such fluids and secretions, may serve as a diagnostic tumor marker. In 
10 particular, detection of higher levels of such shed polypeptides in a subject 

compared to a subject known not to have any neoplastic disease or compared to 
earlier samples from the same subject, can be indicative of neoplastic disease in 
the subject. 

Provided herein are isolated substantially pure single polypeptides that 

15 contain the protease domain of an MTSP as a single chain. The MTSPs 

contemplated herein are not expressed on endothelial cells, and, preferably, are 
expressed on tumor cells, typically at a level that differs from the level in which 
they are expressed in the non-tumor cell of the same type. Hence, for example, 
if the MTSP is expressed in an ovarian tumor cell, to be of interest herein with 

20 respect to ovarian cancer, it is expressed at the same level in non-tumor ovarian 
cells. MTSP protease domains include the single chain protease domains of 
MTSP1 , MTSP3, MTSP4, MTSP6 and other such proteases, including, but are 
not limited to, corin, enterpeptidase, human airway trypsin-like protease (HAT), 
MTSP1 , TMPRS2, and TMPRSS4. 

25 Provided are the protease domains or proteins that include a portion of an 

MTSP that is the protease domain of any MTSP, particularly an MTSP1 , MTSP3, 
MTSP4 and MTSP6. The protein can also include other non-MTSP sequences of 
amino acids, but will include the protease domain or a sufficient portion thereof 
to exhibit catalytic activity in any in vitro assay that assess such protease 

30 activity, such as any provided herein. 

Also provided herein are nucleic acid molecules that encode MTSP 
proteins and the encoded proteins. In particular, nucleic acid molecules 
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encoding MTSP-3 and MTSP-4 from animals, including splice variants thereof are 
provided. The encoded proteins are also provided. Also provided are functional 
domains thereof/ 

In specific aspects, the MTSP protease domains, portions thereof, and 
5 muteins thereof are from or based on animal MTSPS, including, but are not 
limited to, rodent, such as mouse and rat; fowl, such as chicken; ruminants, 
such as goats, cows, deer, sheep; ovine, such as pigs; and humans. 

In particular, MTSP derivatives can be made by altering their sequences 
by substitutions, additions or deletions that provide for functionally equivalent 
10 molecules. Due to the degeneracy of nucleotide coding sequences, other nucleic 
sequences which encode substantially the same amino acid sequence as a MTSP 
gene can be used. These include but are not limited to nucleotide sequences 
comprising all or portions of MTSP genes that are altered by the substitution of 
different codons that encode the amino acid residue within the sequence, thus 

15 producing a silent change. Likewise, the MTSP derivatives include, but are not 
limited to, those containing, as a primary amino acid sequence, all or part of the 
amino acid sequence of MTSP, including altered sequences in which functionally 
equivalent amino acid residues are substituted for residues within the sequence 
resulting in a silent change. For example, one or more amino acid residues 

20 within the sequence can be substituted by another amino acid of a similar 
polarity which acts as a functional equivalent, resulting in a silent alteration. 
Substitutes for an amino acid within the sequence may be selected from other 
members of the class to which the amino acid belongs. For example, the 
nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, 

25 proline, phenylalanine, tryptophan and methionine. The polar neutral amino 
acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and 
glutamine. The positively charged (basic) amino acids include arginine, lysine 
and histidine. The negatively charged (acidic) amino acids include aspartic acid 
and glutamic acid (see, e.g., Table 1). 

30 In a preferred embodiment, the substantially purified MTSP protease is 

encoded by a nucleic acid that hybridizes to the a nucleic acid molecule 
containing the protease domain encoded by the nucleotide sequence set forth in 
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any of SEQ. ID Nos. 1, 3, 5 f 7, 9 or 1 1 under at least moderate, generally high, 
stringency conditions, such that the protease domain encoding nucleic acid 
thereof hybridizes along its full length. In preferred embodiments the 
substantially purified MTSP protease is a single chain polypeptide that includes 
5 substantially the sequence of amino acids set forth in any SEQ ID Nos. 2, 4, 6, 
8, 10 and 12 that encodes the protease domain. Specific 
sequences for the following human MTSPs and domains thereof are provided as 
follows: SEQ ID No. 3 MTSP3 nucleic acid sequence; SEQ ID No. 4 MTSP3 
amino acid sequence; SEQ ID No. 5 MTSP4 nucleic acid encoding the protease 
10 domain; SEQ ID No. 6 MTSP4 amino acid sequence of the protease domain; SEQ 
ID No. 7 MTSP4-L nucleic acid sequence; SEQ ID No. 8 MTSP4-L amino acid 
sequence; SEQ ID No. 9 MTSP4-S nucleic acid sequence; SEQ ID No. 10 
MTSP4-S amino acid sequence; SEQ ID No. 1 1 MTSP6 nucleic acid sequence; 
SEQ ID No. 12 MTSP6 amino acid sequence. SEQ ID No. 1 sets forth the nucleic 
15 acid sequence of the long form of MTSP1 ; SEQ ID No. 2 the encoded amino acid 
sequence; SEQ ID No. 49 sets forth the sequence of a protease domain of an 
MTSP1, and SEQ ID No. 50 the sequence of the encoded single chain protease 
domain thereof. Figures 1-3 depict the structural organization of the MTSP3, 
MTSP4 and MTSP6, respectively. 
20 ,n Particular, exemplary protease domains include at least a sufficient 

portion of sequences of amino acids set forth as amino acids 615-855 in SEQ ID 
No. 2 (encoded by nucleotides 1865-2587 in SEQ ID No. 1; see also SEQ ID 
Nos. 49 and 50) from MTSP1 (matriptase), amino acids 205-437 of SEQ ID NO. 
4 from MTSP3, SEQ ID No. 6, which sets forth the protease domain of MTSP4, 
25 and amino acids 217-443 of SEQ ID No. 1 1 from MTSP6. Also 

contemplated are nucleic acid molecules that encode a single chain MTSP 
protease that have proteolytic activity in an in vitro proteolysis assay and that 
have at least 60%, 70%, 80%, 85%, 90% or 95% sequence identity with the 
full length of a protease domain of an MTSP protein, or that hybridize along their 
30 full length to a nucleic acids that encode a protease domain, particularly under 
conditions of moderate, generally high, stringency. As above, the encoded 
polypeptides contain the protease as a single chain. 
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The isolated nucleic acids may include of at least 8 nucleotides of an 
MTSP sequence. In other embodiments, the nucleic acids may contain least 25 
(continuous) nucleotides, 50 nucleotides, 100 nucleotides, 1 50 nucleotides, or 
.200 nucleotides of a MTSP sequence, or a full-length MTSP coding sequence. In 
5 another embodiment, the nucleic acids are smaller than 35, 200 or 500 

nucleotides in length. Nucleic acids can be single or double stranded. Nucleic 
acids that hybridizes to or complementary to the foregoing sequences, in 
particular the inverse complement to nucleic acids that hybridizes to the 
foregoing sequences {i.e., the inverse complement of a nucleic acid strand has 

10 the complementary sequence running in reverse orientation to the strand so that 
the inverse complement would hybridize without mismatches to the nucleic acid 
strand; thus, for example, where the coding strand is that hybridizes to a nucleic 
acid with no mismatches between the coding strand and the that hybridizes 
strand, then the inverse complement of the that hybridizes strand is identical to 

15 the coding strand) are also provided. In specific aspects, nucleic acids are 

provided that include a sequence complementary to (specifically are the inverse 
complement of) at least 10, 25, 50, 100, or 200 nucleotides or the entire coding 
region of an MTSP encoding nucleic acid, particularly the protease domain 
thereof. For MTSP3 and MTSP4 the full-length protein or domain or active 

20 fragment thereof. 

For each of the nucleic acid molecules, the nucleic acid can be DNA or 
RNA or PNA or other nucleic acid analogs or may include non-natural nucleotide 
bases. 

Also provided are isolated nucleic acid molecules that include a sequence 
25 of nucleotides complementary to the nucleotide sequence encoding an MTSP. 

Probes and primers derived from the nucleic acid molecules are provided, 
Such probes and primers contain at least 8, 14, 16, 30, 100 or more contiguous 
nucleotides with identity to contiguous nucleotides of an MTSP, including, but 
are not limited to, MTSP1 , MTSP3, MTSP4 and MTSP6. The probes and primers 
30 are optionally labelled with a detectable label, such as a radiolabel or a 
fluorescent tag, or can be mass differentiated for detection by mass 
spectrometry or other means. 
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Plasmids and vectors containing the nucleic acid molecules are also 
provided. Cells containing the vectors, including cells that express the encoded 
proteins are provided. The cell can be a bacterial cell, a yeast cell, a fungal cell, 
a plant cell, an insect cell or an animal cell. Methods for producing an MTSP or 
5 single chain form of the protease domain thereof by, for example, growing the 
cell under conditions whereby the encoded MTSP is expressed by the cell, and 
recovering the expressed protein, are provided herein. As noted, for MTSP3 and 
MTSP4, the fuU-length zymogens and activated proteins and activated {two 
strand) protease and single chain protease domains are provided. 

10 Except for the MTSP proteins (MTSP3 and MTSP4) heretofore 

unidentified and provided herein, the isolated polypeptides contain the MTSP 
protease domain or a catalyticalfy active portion thereof and, generally, do not 
contain additional MTSP. Hence isolated, substantially pure proteases, protease 
domains or catalytically active portion thereof in single chain form of MTSPs are 

15 provided. The protease domains may be included in a longer protein, but such 
longer protein is not the MTSP zymogen. 

Thus, MTSP3 and MTSP4 proteins are provided. For these proteins, the 

4 

domains, fragments, derivatives or analogs that are functionally active, Le., 
capable of exhibiting one or more functional activities associated with the MTSP 

20 protein, e.g., serine protease activity, immunogenicity and antigenicity, are 

provided. As discussed above, the protease domains thereof are also provided. 
For MTSP3 and MTSP4, the zymogens and activated forms, and also, the single 
chain and double chain, activated protease domains are provided. 

Also provided are nucleic acid molecules that hybridize to the above- 

25 noted sequences of nucleotides encoding MTSP3 and MTSP4 (SEQ ID Nos. 3, 5, 
7 and 9) at least at low stringency, more preferably at moderate stringency, and 
most preferably at high stringency, and that encode the protease domain and/or 
the full length protein or other domains of an MTSP family member, such as 
MTSP3, MTSP4, MTSP6 or a splice variant or allelic variant thereof, or MTSP6 

30 or a splice variant or allelic variant thereof. Preferably the molecules hybridize 
under such conditions along their full length for at least one domain and encode 
at least one domain, such as the protease or extracellular domain, of the 
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polypeptide. In particular, such nucleic acid molecules include any isolated 
nucleic fragment that encodes at least one domain of a membrane serine 
protease, that (1) contains a sequence of nucleotides that encodes the protease 
or a domain thereof, and (2) is selected from among: 
5 (a) a sequence of nucleotides that encodes the protease or a domain 

thereof includes a sequence of nucleotides set forth above; 

(b) a sequence of nucleotides that encodes such portion or the full 
length protease and hybridizes under conditions of high stringency, 
preferably to nucleic acid that is complementary to a mRNA 

10 transcript present in a mammalian cell that encodes such protein 

or fragment thereof; 

(c) a sequence of nucleotides that encodes a transmembrane protease 
or domain thereof that includes a sequence of amino acids 
encoded by such portion or the full length open reading frame; and 

15 (d) a sequence of nucleotides that encodes the transmembrane 

protease that includes a sequence of amino acids encoded by a 
sequence of nucleotides that encodes such subunit and hybridizes 
under conditions of high stringency to DNA that is complementary 
to the mRNA transcript. 
20 Exemplary MTSPs 

The above discussion provides an overview and some details of the 
exemplified MTSPs. The following discussion provides additional details (see, 
also, EXAMPLES). 

MTSP1 (matriptase) 

25 Matriptase is a trypsin-like serine protease with broad spectrum cleavage 

activity and two potential regulatory modules. It was named "matriptase" 
because its ability to degrade the extra-cellular matrix and its trypsin-like activity. 
When isolated from breast cancer cells (or T-47D cell conditioned medium), 
matriptase has been reported to be primarily in an uncomplexed form. 

30 Matriptase has been isolated from human milk; when isolated from human milk, 
matriptase was reported to be in one of two complexed forms, 95 kDa (the 
predominant form) and 110 kDa; uncomplexed matriptase was not detected. 
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(Liu, etal.,J. Biol. Chem. 274 (26): 1 8237-1 8242 (1999).) It has been proposed 
that matriptase exists as an uncomplexed protease when in its active state. In 
breast milk, matriptase has been reported to exist in complex with a fragment of 
hepatocyte growth factor inhibitor-1 (HAI-1), a Kuntz-type serine protease 
5 inhibitor having activity against trypsin-like serine proteases. 

Ecotin and Ecotin M84R/M85R are macromolecular inhibitors of serine 
proteases of the chymotrypsin fold and inhibit ductal branching, morphogenesis 
and differentiation of the explanted ductal prostate. PC-3 is a cell line derived 
from prostate cancer epithelial cells. Ecotin and M84R/M85R ecotin were found 

10 to decrease tumor size and metastasis in PC-3 implanted nude mice. 

Matriptase has been isolated and its encoding nucleic acids cloned from 
T-47D human breast cancer cell-conditioned medium (Lin et al. (1999) J. Biol. 
Chem, 274:18231-18236). Upon analysis of the cDNA, it was determined that 
the full length protease has 683 amino acids and contains three main structural 

15 regions: a serine protease domain near the carboxyl-terminal region, four 

tandem low-density lipoprotein receptor domains, and two tandem complement 
subcomponents Clrand C1s. 

Studies to identify additional serine proteases made by cancer cells were 
done using PC-3 cells. A serine protease termed "MT-SP1 ", reported to be a 

20 transmembrane protease was cloned (Takeuchi eta/. (1999) Proc. Natl. Acad. 
Sci. U.S.A. 96*A 1054-1 1061). It was subsequently found the originally 
identified matriptase sequence is included in the translated sequence of the 
cDNA that encodes MT-SP1 . The matriptase cDNA was reported to be a partial 
MT-SP1 cDNA and to lack 516 of the coding nucleotides (Takeuchi, et aL, J. 

25 Biol. Chem 275:26333-26342 (2000).) Since the reported matriptase encoding 
cDNA sequence encoded a possible initiating methionine, it was proposed that 
alternative splicing could yield a protein lacking the N-terminal region of MTSP1 . 

Matriptase and MT-SP1 demonstrate trypsin-Jike protease activity and are 
30 Type II transmembrane proteins with a common extracellular protease domain. 
Studies of substrate specificity of MT-SP1 reveal that protease-activated 
receptor 2 (PAR2) and single-chain urokinase-type plasminogen activator (sc- 
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uPA) are macromolecular substrates of MT-SP1 . PAR2 is functions in 
inflammation, cytoprotection and/or cell adhesion, while sc-uPa is functions in 
tumor cell invasion and metastasis. 

An exemplary nucleotide sequences encoding a human MTSP1 is set 
5 forth in SEQ ID Nos 1 and 2 (see also SEQ ID Nos. 49 and 50 for the protease 
domain thereof). As previously noted SEQ ID No. 1 sets for an MTSP1 -encoding 
nucleic acid sequence. This sequence is the longer version and includes the 
protease domain, which is common to both variants Nucleic acids encoding the 
MTSP that hybridizes to the nucleotide sequence set forth in SEQ ID No. 1 can 
10 be obtained by any method known in the art, e.g, by PCR amplification using 

synthetic primers that hybridizes to the 3' and 5' ends of the sequence and/or by 
cloning from a cDNA or genomic library using a PCR amplification product or an 
oligonucleotide specific for the gene sequence {e.g., as described in Section C 
herein). Homologs (e.g., nucleic acids of the above-listed genes of species other 
15 than human) or other related sequences (e.g., paralogs) and muteins can be 

obtained by low, moderate or high stringency hybridization with all or a portion 
of the particular sequence provided as a probe using methods well known in the 
art for nucleic acid hybridization and cloning. 

Isolated single chain protease domains of MTSP1 proteins from animals 
20 are provided herein. As shown herein, the single chain protease domain is 
catalytically active and can be used in a variety of drug screening assays, 
particularly in vitro proteolytic assays. Exemplary MTSP protease domains are 
set forth as the amino acids (615-855 of SEQ ID No. 2) encoded by nucleotides 
1865-2587 of SEQ ID No. 1 (see, also, SEQ ID Nos. 49 and 50). The MTSP1 
25 single chain protease domain is catalytically active 

Muteins of the MTSP1 proteins are provided. In the activated double 
chain molecule, residue 731 forms a disulfide bond with the Cys at residue 604. 
In the single chain form, the residue at 731 in the protease domain is free. 
Muteins in which Cys residues, particularly the free Cys residue (amino acid 731 
30 in SEQ ID No. 2) in the single chain protease domain are provided. Other 

muteins in which conservative amino acids replacements are effected and that 
retain proteolytic activity as a single chain are also provided. Such changes may 
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be systematically introduced and tested for activity in in vitro assays, such as 
those provided herein. 

MTSP3 

In a specific embodiment, a nucleic acid that encodes a MTSP, 
5 designated MTSP3 is provided. In particular the nucleic acid includes an open 
reading frame within the following sequence of nucleotides set forth in SEQ ID 
No. 3. In particular the protein is encoded by the open reading frame that begins 
at nucleotide 261 and ends at 1 574. 

Also provided are nucleic acid molecules that hybridize under conditions 
10 of at least low stringency, preferably moderate stringency, more preferably high 
stringency to the following sequence of nucleic acids (SEQ ID No. 3), particularly 
to the open reading frame encompassed by nucleotides that encode a single 
protease domain thereof, or any domain of MTSP3 

Also included are substantially purified MTSP3 zymogen, activated double 
15 chains, single chain protease domains and double chain protease domains. 

These are encoded by a nucleic acid that includes sequence encoding a protease 
domain that exhibits proteolytic activity and that hybridizes to a nucleic acid 
molecule having a nucleotide sequence set forth in SEQ ID No. 3, typically under 
moderate, generally under high stringency, conditions and most preferably along 
20 the fulf length of the protease domain. Splice variants are also contemplated 
herein. 

In a preferred embodiment, the isolated nucleic acid fragment hybridizes 
to the nucleic acid having the nucleotide sequence set forth in SEQ ID No: 3 
under high stringency conditions, and preferably comprises the sequence of 

25 nucleotides set forth in any of SEQ ID Nos. 3 or comprises a portion thereof that 
encodes a transmembrane domain and may additionally include a LDLR domain, 
a scavenger-receptor cysteine rich (SRCR) domain and a serine protease catalytic 
domain or any other identified domain (see FIGURES) or comprises nucleic acid 
molecule that encodes the protein encoded by SEQ ID NO. 4. 

30 The isolated nucleic acid fragment is DNA, including genomic or cDNA, or 

is RNA, or can include other components, such as protein nucleic acid. The 
isolated nucleic acid may include additional components, such as heterologous or 
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native promoters, and other transcriptional and translationa) regulatory 
sequences, these genes may be linked to other genes, such as reporter genes or 
other indicator genes or genes that encode indicators. 

Also provided is an isolated nucleic acid molecule that includes the 
5 sequence of molecules that is complementary to the nucleotide sequence 
encoding the MTSP or the portion thereof. 

Also provided are fragments thereof that can be used as probes or 
primers and that contain at least about 10 nucleotides, more preferably 14 
nucleotides, more preferably at least about 16 nucleotides, most preferably at 
10 least about 30 nucleotides. 

Hence provided herein are polypeptides that are encoded by such nucleic 
acid molecules. Included among those polypeptides are the MTSP3 protease 
domain or a polypeptide with conservative amino acid changes such that the 
specificity and protease activity remains substantially unchange. In particular, a 
1 5 substantially purified mammalian MTSP protein is provided that has a 

transmembrane domain and may additionally include a CUB domain, one or more 
of an LDLR domain(s), a scavenger-receptor cysteine rich (SRCR) domain and a 
serine protease catalytic domain is provided. 

Also provided is a substantially purified protein comprising a sequence of 
20 amino acids that has at least 60%, more preferably at least about 90%, most 
preferably at least about 95%, identity to the MTSP3, wherein the percentage 
identity is determined using standard algorithms and gap penalties that maximize 
the percentage identity. The human MTSP3 protein is most preferred, although 
other mammalian MTSP3 proteins are contemplated. 
25 Muteins of MTSP3, particularly those in which Cys residues, such as the 

Cys310 in the single chain protease domain, is replaced with another amino acid 
that does not eliminate the activity, are provided. 

MTSP4 

Among the proteins provided herein is MTSP4. MTSP4 is highly 
30 expressed in the liver, and is expressed in substantially lower levels in other 
tissues (see, EXAMPLES). It is also expressed in non-liver-derived tumors (see 
EXAMPLES), including Burkitt's lymphoma, colorectal adenocarcinoma 



WO 01/57194 



PCT/US01/03471 



-55- 

(SW480), lung carcinoma (A549), and in leukemic cells, indicating a role in one 
or more of tumor progression, tumor invasion, tumor growth and tumor 
metastases. 

Also provided are nucleic acid molecules that hybridize under conditions 
5 of at least low stringency, preferably moderate stringency, more preferably high 
stringency to the sequence of nucleic acids set forth in SEQ ID Nos. 5, 7 or 9), 
particularly to the open reading frame encompassed by nucleotides that encode a 
single protease domain thereof, or any domain of an MTSP4. 

Also included are substantially purified MTSP4 zymogens, activated 

10 double chains, single chain protease domains and double chain protease 

domains. These are encoded by a nucleic acid that includes sequence encoding 
a protease domain that exhibits proteolytic activity and that hybridizes to a 
nucleic acid molecule having a nucleotide sequence set forth in SEQ ID Nos. 5, 7 
and 9, typically under moderate, generally under high stringency, conditions and 

15 most preferably along the full length of the protease domain. 

In a preferred embodiment, the isolated nucleic acid fragment hybridizes 
to the nucleic acid having the nucleotide sequence set forth in SEQ ID No: 5, 7 
or 9 under high stringency conditions, and preferably comprises the sequence of 
nucleotides set forth in any of SEQ ID Nos. 5, 7 or 9 comprises a portion thereof 

20 that encodes a transmembrane domain and may additionally include a LDLR 

domain, a scavenger-receptor cysteine rich (SRCR) domain and a serine protease 
catalytic domain or any other identified domain (see FIGURES) or comprises 
nucleic acid molecule that encodes the protein encoded by SEQ ID NO. 6, 9 or 
10.. 

25 The isolated nucleic acid fragment is DNA, including genomic or cDNA, or 

is RNA, or can include other components, such as protein nucleic acid. The 
isolated nucleic acid may include additional components, such as heterologous or 
native promoters, and other transcriptional and translations! regulatory 
sequences, these genes may be linked to other genes, such as reporter genes or 

30 other indicator genes or genes that encode indicators. 
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Also provided is an isolated nucleic acid molecule that includes the 
sequence of molecules that is complementary to the nucleotide sequence 
encoding and MTSP4 or the portion thereof. 

Also provided are fragments thereof that can be used as probes or 
5 primers and that contain at least about 10 nucleotides, more preferably 14 
nucleotides, more preferably at least about 1 6 nucleotides, most preferably at 
least about 30 nucleotides. 

In particular nucleic acid molecules encoding two forms of MTSP4 are 
provide. The encoded proteins are multi-domain, type II membrane-type serine 
10 proteases and include a transmembrane domain at the N terminus followed by a 
CUB domain, 3 LDLR domains and a trypsin-like serine protease domain at the C 
terminus. The difference between the two forms, which are splice variants, is 
the absence in MTSP4-S of a 432-bp nucleotide sequence between the 
transmembrane and the CUB domains (see FIGURE 2; see, also SEQ ID Nos. 5- 
15 10). 

Also provided is a nucleic acid that encodes the extracellular protease 
domain of an MTSP4 is provided. Both forms of MTSP4 exemplified herein 
include a protease domain in common (see SEQ ID Nos. 5 and 6). 

In particular, the extracellular protease domain of the MTSP4 proteins is 

20 encoded by the open reading frame that begins at nucleotide 1 and ends at 708 
(TGA) (SEQ ID No. 5. This open reading frame encodes a portion of the MSTP4 
protein and includes the protease domain. Full length MSTP4 proteins (SEQ ID 
Nos. 7 and 9) include the above domain. The extracellular protease domain, as 
a single chain, and also an activated double chain, exhibit protease activity. The 

25 disulfide bonds that form that two chain form of MTSP forms are likely between 
Cys415 and Cys535 for MTSP4-S, and between Cys559 and Cys679 for 
MTSP4-L. 

For use of the single chain protease domain thereof, it is of interest to 
replace the free Cys (i.e. Cys535 (Cys679)) in the protease domain with another 
30 amino acid, such as any amino acid that does not alter the function (such 
change is likely to be any amino acid). Thus, muteins of MTSP4, particularly 



WO 01/57194 



PCT/US01/03471 



-57- 

those in which Cys residues, such as the Cys535 and Cys679 in the single chain 
protease domains of MTSP4-S and MTSP4-L, respectively, are provided. 

MTSP6 

Nucleic acid and the encoded MTSP6 protein of an exemplary MTSP6 are 
5 also provided. The respective sequences are set forth in SEQ ID Nos. 1 1 and 
12. The MTSP6 DNA and protein sequences were analyzed using DNA Strider 
{version 1.2). The ORF encoding the MTSP6 variant provided herein is 
composed of 1 ,362 bp, which translate into a 453-amino acid protein. MTSP6 
is a multi-domain, type-ll membrane-type serine protease containing a 

10 transmembrane domain (amino acids 48-68) at the N-terminus followed by a 

LDLRa domain (LDL receptor domain class a) (amino acids 72-108), a SR domain 
(Scavenger receptor Cys-rich domain)(amino acids 109-205), and a trypsin-like 
serine protease domain (amino acids 216-443) (see FIGURE 3). Muteins of 
MTSP6, particularly those in which Cys residues, such as the Cys324 in the 

15 single chain protease domain of MTSP6 are provided. 

International PCT application No. WO 00/52044 describes MTSPs that 
resemble the MTSP6 provided herein. The polypeptide provided therein differs at 
single amino acid positions, such as 90 in SEQ ID No. 1 2 (Ala is replaced with a 
Thr), and significantly from the instant MTSP6 in that ten amino acids (amino 

20 acid nos. 46-55 in SEQ ID No. 12) are replaced with the eleven amino acids: 

phe glu val phe ser gin ser ser ser leu gly (SEQ ID No. 59) resulting in a protein 
that is one 454 amino acids long. 

There are a few other amino acid sequence differences and a number of 
nucleic acid sequence differences. Significantly, there are substantial differences 

25 in the protease domain at amino acids 368-394 (368 

ICNHRDVYGGIISPSMLCAGYLTGGVD 394; SEQ ID No. 12) are replaced at 

position 369-396 with animo acids: 

369 DLQPQ — GRVRWHHLPLHALRGLPDGWRWN 396, where the differences 
from 368-394 (Seq ID No. 12) are indicated. 
30 In addition, a second C-terminus truncated variant with an altered 

protease domain is identified in the PCT application. The variant is the same as 
the 454 variant through amino acid 261 thereof (corresponding to 160 of SEQ 
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ID No. 1 2 herein), followed by 33 amino acids (see SEQ ID No. 60 herein) that 
differ by virtue of a frame shift. 

C. Tumor specificity and tissue expression profiles 

Each MTSP has a characteristic tissue expression profile; the MTSPs in 
5 particular, although not exclusively expressed or activated in tumors, exhibit 
characteristic tumor tissue expression or activation profiles. In some instances, 
MTSPs may have different activity in a tumor cell from a non-tumor cell by virtue 
of a change in a substrate or cofactor thereof or other factor that would alter the 
apparent functional activity of the MTSP. Hence each can serve as a diagnostic 

10 marker for particular tumors, by virtue of a level of activity and/or expression or 
function in a subject (i.e. a mammal, particularly a human) with neoplastic 
disease, compared to a subject or subjects that do not have the neoplastic 
disease. In addition, detection of activity (and/or expression) in a particular 
tissue can be indicative of neoplastic disease. Shed MTSPs in body fluids can 

15 be indicative of neoplastic disease. Also, by virtue of the activity and/or 

expression profiles of each, they can serve as therapeutic targets, such as by 
administration of modulators of the activity thereof, or, as by administration of a 
prodrug specifically activated by one of the MTSPs. 

Tissue expression profiles 

20 MTSP3 

The MTSP3 transcript was detected in lung carcinoma (LX-1), colon 
adenocarcinoma (CX-1), colon adenocarcinoma (GI-112) and ovarian carcinoma 
(GI-102). No apparent signal was detected in another form of lung carcinoma 
(GI-117), breast carcinoma (GI-101), pancreatic adenocarcinoma (GI-103) and 

25 prostatic adenocarcinoma (PC3). 

MTSP1 is expressed in breast cancers. 

MTSP4 

The MTSP4 transcript, a DNA fragment encoding part of the LDL receptor 
domain and the protease domain was used to probe an RNA blot composed of 
30 76 different human tissues (catalog number 7775-1; human multiple tissue 

expression (MTE) array; CLONTECH). As in the northern analysis of gel blot, a 
very strong signal was observed in the liver. Signals in other tissues were 
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observed in (decreasing signal level): fetal liver > heart = kidney = adrenal 
gland = testis = fetal heart and kidney = skeletal muscle = bladder = placenta 
> brain — spinal cord = colon = stomach = spleen = lymph node = bone 
marrow = trachea = uterus = pancreas = salivary gland = mammary gland = 
5 lung. MTSP4 is also expressed less abundantly in several tumor cell lines 

including HeLa S3 = leukemia K-562 = Burkitt's lymphomas (Raji and Daudi) = 
colorectal adenocarcinoma (SW480) > lung carcinoma (A54-9) = leukemia 
MOLT-4 = leukemia HL-60. PCR of the MTSP4 transcript from cDNA libraries 
made from several human primary tumors xenografted in nude mice (human 

10 tumor multiple tissue cDNA panel, catalog number K1 522-1, CLOIMTECH) was 

performed using MTSP4-specific primers. The MTSP4 transcript was detected in 
breast carcinoma (GI-1 01), lung carcinoma (LX-1), colon adenocarcinoma 
(GI-1 12) and pancreatic adenocarcinoma (GI-1 03). No apparent signal was 
detected in another form of lung carcinoma (GI-1 1 7), colon adenocarcinoma 

15 (CX-1), ovarian carcinoma (GI-1 02) and prostatic adenocarcinoma (PC3). The 

MTSP4 transcript was also detected in LNCaP and PC-3 prostate cancer cell 

lines as well as in HT-1080 human fibrosarcoma cell line. 

Gene expression profile of MTSP6 in normal and tumor 
tissues 

20 To obtain information regarding the gene expression profile of the MTSP6 

transcript, a 495 bp DNA fragment obtained from PCR reaction with primers 
CM7-NSP-3 and NSP-4AS was used to probe an RNA blot composed of 76 
different human tissues (catalog number 7775-1; human multiple tissue 
expression (MTE) array; CLONTECH). The strongest signal was observed in 

25 duodenum. Signal in other tissues were observed in (decreased signal level): 
Stomach > trachea = mammary gland = thyroid gland = salivary gland = 
pituitary gland = pancreas > kidney > lung > jejunum = ileum - ilocecum = 
appendix = fetal kidney > fetal lung. Very weak signals can also be detected 
in several other tissues. 

30 MTSP6 is also expressed in several tumor cell lines including HeLa S3 > 

colorectal adenocarcinoma (SW480) > leukemia MOLT-4 > leukemia K-562. 
PCR analysis of the MTSP6 transcript from cDNA libraries made from several 
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human primary tumors xenografted in nude mice (human tumor multiple tissue 
cDNA panel, catalog number K1 522-1, CLONTECH) was performed using 
MTSP6-specific primers (CM7-NSP-3 and Chi 7-NSP2AS). The MTSP6 
transcript was strongly detected in lung carcinoma (LX-1), moderately detected 
5 in pancreatic adenocarcinoma (GI-103), weakly detected in ovarian carcinoma 
(GI-102); and very weakly detected in colon adenocarcinoma (GI-1 12 and CX-1), 
breast carcinoma (GI-1 01), lung carcinoma (GI-1 17) and prostatic 
adenocarcinoma (PC3). The MTSP6 transcript was also detected in breast 
cancer cell line MDA-MB-231 , prostate cancer cell line PC-3, but not in HT-1080 
10 human fibrosarcoma cell line. MTSP6 is also expressed in mammary gland 
carcinoma cDNA (Clontech). MTSP6 is also over expressed in ovarian tumor 
cells. 

D. Identification and isolation of MTSP protein genes 

The MTSP proteins, or domains thereof, can be obtained by methods well 

15 known in the art for protein purification and recombinant protein expression. 
Any method known to those of skill in the art for identification of nucleic acids 
that encode desired genes may be used. Any method available in the art can be 
used to obtain a full length (i.e., encompassing the entire coding region) cDNA or 
genomic DNA clone encoding an MTSP protein. In particular, the polymerase 

20 chain reaction (PCR) can be used to amplify a sequence identified as being 

differentially expressed in normal and tumor celts or tissues, e.g., nucleic acids 
encoding an MTSP protein (SEQ. NOs: 1-12), in a genomic or cDNA library. 
Oligonucleotide primers that hybridize to sequences at the 3' and 5' termini of 
the identified sequences can be used as primers to amplify by PCR sequences 

25 from a nucleic acid sample (RNA or DNA), preferably a cDNA library, from an 
appropriate source (e.g., tumor or cancer tissue). 

PCR can be carried out, e.g., by use of a Perkin-Elmer Cetus thermal 
cycler and Taq polymerase (Gene Amp"). The DNA being amplified can include 
mRNA or cDNA or genomic DNA from any eukaryotic species. One can choose 

30 to synthesize several different degenerate primers, for use in the PCR reactions. 
It is also possible to vary the stringency of hybridization conditions used in 
priming the PCR reactions, to amplify nucleic acid homologs (e.g., to obtain 
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MTSP protein sequences from species other than humans or to obtain human 
sequences with homology to MTSP protein) by allowing for greater or lesser 
degrees of nucleotide sequence similarity between the known nucleotide 
sequence and the nucleic acid homolog being isolated. For cross species 
5 hybridization, low stringency conditions are preferred. For same species 

w. 

hybridization, moderately stringent conditions are preferred. After successful 
amplification of the nucleic acid containing all or a portion of the identified MTSP 
protein sequence or of a nucleic acid encoding all or a portion of an MTSP 
protein homolog, that segment may be molecularly cloned and sequenced, and 

10 used as a probe to isolate a complete cDNA or genomic clone. This, in turn, will 
permit the determination of the gene's complete nucleotide sequence, the 
analysis of its expression, and the production of its protein product for functional 
analysis. Once the nucleotide sequence is determined, an open reading frame 
encoding the MTSP protein gene protein product can be determined by any 

15 method well known in the art for determining open reading frames, for example, 
using publicly available computer programs for nucleotide sequence analysis. 
Once an open reading frame is defined, it is routine to determine the amino acid 
sequence of the protein encoded by tfie open reading frame. In this way, the 
nucleotide sequences of the entire MTSP protein genes as well as the amino acid 

20 sequences of MTSP protein proteins and analogs may be identified. 

Any eukaryotic cell potentially can serve as the nucleic acid source for 
the molecular cloning of the MTSP protein gene. The nucleic acids can be 
isolated from vertebrate, mammalian, human, porcine, bovine, feline, avian, 
equine, canine, as well as additional primate sources, insects, plants, etc. The 

25 DNA may be obtained by standard procedures known in the art from cloned DNA 
(e.g., a DNA "library"), by chemical synthesis, by cDNA cloning, or by the 
cloning of genomic DNA, or fragments thereof, purified from the desired cell 
(see, for example, Sambrook et al., 1989, Molecular Cloning, A Laboratory 
Manual, 2d Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 

30 York; Glover, D.M. (ed.), 1985, DNA Cloning: A Practical Approach, MRL Press, 
Ltd., Oxford, U.K. Vol. I, II). Clones derived from genomic DNA may contain 
regulatory and intron DNA regions in addition to coding regions; clones derived 
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from cDNA will contain only exon sequences. Whatever the source, the gene 
should be molecularly cloned into a suitable vector for propagation of the gene. 

In the molecular cloning of the gene from genomic DNA, DNA fragments 
are generated, some of which will encode the desired gene. The DNA may be 
5 cleaved at specific sites using various restriction enzymes. Alternatively, one 
may use DNAse in the presence of manganese to fragment the DNA, or the DNA 
can be physically sheared, for example, by sonication. The linear DNA 
fragments can then be separated according to size by standard techniques, 
including but not limited to, agarose and polyacrylamide gel electrophoresis and 

10 column chromatography. 

Once the DNA fragments are generated, identification of the specific DNA 
fragment containing the desired gene may be accomplished in a number of 
ways. For example, a portion of the MTSP protein (of any species) gene (e.g., a 
PCR amplification product obtained as described above or an oligonucleotide 

15 having a sequence of a portion of the known nucleotide sequence) or its specific 
RNA, or a fragment thereof be purified and labeled, and the generated DNA 
. fragments may be screened by nucleic acid hybridization to the labeled probe 
(Benton and Davis, Science 1 96 :180 (1977); Grunstein and Hogness, Proc. Natl. 
Acad. ScL U.S.A. 72:3961 (1975)). Those DNA fragments with substantial 

20 homology to the probe will hybridize. It is also possible to identify the 

appropriate fragment by restriction enzyme digestion(s) and comparison of 
fragment sizes with those expected according to a known restriction map if such 
is available or by DNA sequence analysis and comparison to the known 
nucleotide sequence of MTSP protein. Further selection can be carried out on 

25 the basis of the properties of the gene. Alternatively, the presence of the gene 
may be detected by assays based on the physical, chemical, or immunological 
properties of its expressed product. For example, cDNA clones, or DNA clones 
which hybrid-select the proper mRNA, can be selected which produce a protein 
that, e.g., has similar or identical electrophoretic migration, isolectric focusing 

30 behavior, proteolytic digestion maps, antigenic properties, serine protease 
activity. If an anti-MTSP protein antibody is available, the protein may be 
identified by binding of labeled antibody to the putatively MTSP protein 
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synthesizing clones, in an EL1SA (enzyme-linked immunosorbent assay)-type 
procedure. 

Alternatives to isolating the MTSP protein genomic DNA include, but are 
not limited to, chemically synthesizing the gene sequence from a known 
5 sequence or making cDNA to the mRNA that encodes the MTSP protein. For 
example, RNA for cDNA cloning of the MTSP protein gene can be isolated from 
cells expressing the protein. The identified and isolated nucleic acids can then 
be inserted into an appropriate cloning vector. A large number of vector-host 
systems known in the art may be used. Possible vectors include, but are not 
10 limited to, plasmids or modified viruses, but the vector system must be 

compatible with the host cell used. Such vectors include, but are not limited to, 
bacteriophages such as lambda derivatives, or plasmids such as pBR322 or pUC 
plasmid derivatives or the Bluescript vector (Stratagene, La Jolla, CA). The 
insertion into a cloning vector can, for example, be accomplished by ligating the 

15 DNA fragment into a cloning vector which has complementary cohesive termini. 
I the complementary restriction sites used to fragment the DNA are not present 
in the cloning vector, the ends of the DNA molecules may be enzymatically 
modified. Alternatively, any site desired may be produced by ligating nucleotide 
sequences (linkers) onto the DNA termini; these ligated linkers may comprise 

20 specific chemically synthesized oligonucleotides encoding restriction 

endonuclease recognition sequences. In an alternative method, the cleaved 
vector and MTSP protein gene may be modified by homopolymeric tailing. 
Recombinant molecules can be introduced into host cells via transformation, 
transfection, infection, electroporation, etc., so that many copies of the gene 

25 sequence are generated. 

In an alternative method, the desired gene may be identified and isolated 
after insertion into a suitable cloning vector in a "shot gun" approach. 
Enrichment for the desired gene, for example, by size fractionization, can be 
done before insertion into the cloning vector. 

30 In specific embodiments, transformation of host cells with recombinant 

DNA molecules that incorporate the isolated MTSP protein gene, cDNA, or 
synthesized DNA sequence enables generation of multiple copies of th gene. 
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Thus, the gene may be obtained in large quantities by growing transformants, 

isolating the recombinant DNA molecules from the transformants and, when 

necessary, retrieving the inserted gene from the isolated recombinant DNA. 

E. Vectors, plasmids and cells that contain nucleic acids encoding an MTSP 
5 protein or protease domain thereof and expression of MTSP proteins 

Vectors and cells 

For recombinant expression of one or more of the MTSP proteins, the 
nucleic acid containing all or a portion of the nucleotide sequence encoding the 
MTSP protein can be inserted into an appropriate expression vector, i.e., a 

10 vector that contains the necessary elements for the transcription and translation 
of the inserted protein coding sequence. The necessary transcriptional and 
translational signals can also be supplied by the native promoter for MTSP 
genes, and/or their flanking regions. 

Also provided are vectors that contain nucleic acid encoding the MTSPs. 

15 Cells containing the vectors are also provided. The cells include eukaryotic and 
prokaryotic cells, and the vectors are any suitable for use therein. 

Prokaryotic and eukaryotic cells, including endothelial cells, containing the 
vectors are provided. Such cells include bacterial cells, yeast cells, fungal cells, 
plant cells, insect cells and animal cells. The cells are used to produce an MTSP 

20 protein or protease domain thereof by growing the above-described cells under 
conditions whereby the encoded MTSP protein or protease domain of the MTSP 
protein is expressed by the cell, and recovering the expressed protease domain 
protein. For purposes herein, the protease domain is preferably secreted into the 
medium. 

25 In one embodiment, the vectors include a sequence of nucleotides that 

encodes a polypeptide that has protease activity and contains all or a portion of 
only the protease domain, or multiple copies thereof, of an MTSP protein are 
provided. Also provided are vectors that comprise a sequence of nucleotides 
that encodes the protease domain and additional portions of an MTSP protein up 

30 to and including a full length MTSP protein, as well as multiple copies thereof, 

are also provided. The vectors may selected for expression of the MTSP protein 
or protease domain thereof in the cell or such that the MTSP protein is 
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expressed as a transmembrane protein. Alternatively, the vectors may include 
signals necessary for secretion of encoded proteins. When the protease domain 
is expressed the nucleic acid is preferably linked to a secretion signal, such as 
the Saccharomyces cerevisiae a mating factor signal sequence or a portion 
5 thereof. 

A variety of host-vector systems may be used to express the protein 
coding sequence. These include but are not limited to mammalian cell systems 
infected with virus {e.g. vaccinia virus, adenovirus, etc.); insect cell systems 
infected with virus (e.g. bacuiovirus); microorganisms such as yeast containing 

10 yeast vectors; or bacteria transformed with bacteriophage, DNA, plasmid DNA, 
or cosmid DNA. The expression elements of vectors vary in their strengths and 
specificities. Depending on the host-vector system used, any one of a number 
of suitable transcription and translation elements may be used. 

Any methods known to those of skill in the art for the insertion of DNA 

15 fragments into a vector may be used to construct expression vectors containing 
a chimeric gene containing appropriate transcriptional/translational control signals 
and protein coding sequences. These methods may include in vitro recombinant 
DNA and synthetic techniques and in vivo recombinants (genetic recombination). 
Expression of nucleic acid sequences encoding MTSP protein, or domains, 

20 derivatives, fragments or homologs thereof, may be regulated by a second 

nucleic acid sequence so that the genes or fragments thereof are expressed in a 
host transformed with the recombinant DNA molecule(s). For example, 
expression of the proteins may be controlled by any promoter/enhancer known in 
the art. In a specific embodiment, the promoter is not native to the genes for 

25 MTSP protein. Promoters which may be used include but are not limited to the 
SV40 early promoter (Bernoist and Chambon, Nature 290 :304-310 (1981)), the 
promoter contained in the 3' long terminal repeat of Rous sarcoma virus 
(Yamamoto et al., Ceil 22:787-797 (1980)), the herpes thymidine kinase 
promoter (Wagner et al., Proc. Natl. Acad. Sci. USA 78:1441-1445 (1981)), the 

30 regulatory sequences of the metallothionein gene (Brinster et al.. Nature 296 :39- 
42 (1982)); prokaryotic expression vectors such as the ^-lactamase promoter 
(Villa-Kamaroff et al., Proc. Natl. Acad. ScL USA 75:3727-3731 1978)) or the 
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tac promoter (DeBoer et al., Proc. Natl. Acad. Sci. USA 80:21-25 (1983)); see 
also "Useful Proteins from Recombinant Bacteria": in Scientific American 
242 :79-94 (1980)); plant expression vectors containing the nopaline synthetase 
promoter (Herrar-Estrella et al., Nature 303 :209-213 (1984)) or the cauliflower 
5 mosaic virus 35S RNA promoter (Garder et ah, Nucleic Acids Res. 9:2871 

(1981)), and the promoter of the photosynthetic enzyme ribulose bisphosphate 
carboxylase (Herrera-Estrella et al., Nature 310 :1 15-120 (1984)); promoter 
elements from yeast and other fungi such as the Gal4 promoter, the alcohol 
dehydrogenase promoter, the phosphoglycerol kinase promoter, the alkaline 
10 phosphatase promoter, and the following animal transcriptional control regions 
that exhibit tissue specificity and have been used in transgenic animals: elastase 
I gene control region which is active in pancreatic acinar cells (Swift et al., Cell 
38:639-646 (1984); Ornitz et al.. Cold Spring Harbor Symp. Quant. Biol. 

m 

50:399-409 (1986); MacDonald, Hepatology 7:425-5 1 5 (1987)); insulin gene 

15 control region which is active in pancreatic beta cells (Hanahan et al.. Nature 
31 5 :1 15-122 (1985)), immunoglobulin gene control region which is active in 
.lymphoid cells (Grosschedl et al., Cell 38:647-658 (1984); Adams et al., Nature 
318:533-538 (1985); Alexander et al., Mol. Cell BioL 7:1436-1444 (1987)), 
mouse mammary tumor virus control region which is active in testicular, breast, 

20 lymphoid and mast cells (Leder et al., Cell 45:485-495 (1986)), albumin gene 
control region which is active in liver (Pinckert et al., Genes and Devel. 1:268- 
276 (1987)), alpha-fetoprotein gene control region which is active in liver 
(Krumlauf et al., Mol. Cell. BioL 5:1639-1648 (1985); Hammer et al., Science 
235 :53-58 1987)), alpha-1 antitrypsin gene control region which is active in liver 

25 (Kelsey et al.. Genes and Devel. 1:161-171 (1987)), beta globin gene control 
region which is active in myeloid cells (Mogram et al., Nature 31 5 :338-340 
(1985); Kollias et al., Cell 46:89-94 (1986)), myelin basic protein gene control 
region which is active in oligodendrocyte cells of the brain (Readhead et al., Ceil 
48:703-712 (1987)), myosin light chain-2 gene control region which is active in 

30 skeletal muscle (Sani, Nature 314 :283-286 (1985)), and gonadotrophic releasing 
hormone gene control region which is active in gonadotrophs of the 
hypothalamus (Mason et al., Science 234 :1372-1378 (1986)). 
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In a specific embodiment, a vector is used that contains a promoter 
operably linked to nucleic acids encoding an MTSP protein, or a domain, 
fragment, derivative or homolog, thereof, one or more origins of replication, and 
optionally, one or more selectable markers [e.g., an antibiotic resistance gene). 
5 Expression vectors containing the coding sequences, or portions thereof, of an 
MTSP protein, is made, for example, by subcloning the coding portions into the 
EcoRI restriction site of each of the three pGEX vectors (glutathione S- 
transferase expression vectors (Smith and Johnson, Gene 7:31-40 (1988)). This 
allows for the expression of products in the correct reading frame. Preferred 

10 vectors and systems for expression of the protease domains of the MTSP 

proteins are well known Pichia vectors (available, for example, from Invitrogen, 
San Diego, CA), particularly those designed for secretion of the encoded 
proteins. One exemplary vector is described in the EXAMPLES. 

Plasmids for transformation of E. coli cells, include, for example, the pET 

15 expression vectors (see, U.S patent 4,952,496; available from NOVAGEN, 

Madison, Wl; see, also literature published by Novagen describing the system). 
Such plasmids include pET 11a, which contains the T7lac promoter, T7 
terminator, the inducible E. coli lac operator, and the lac repressor gene; pET 
12a-c, which contains the T7 promoter, T7 terminator, and the E. coli ompT 

20 secretion signal; and pET 15b and pET1 9b (NOVAGEN, Madison, Wl), which 

contain a His-Tag™ leader sequence for use in purification with a His column and 
a thrombin cleavage site that permits cleavage following purification over the 
column; the T7-lac promoter region and the T7 terminator. 

The vectors are introduced into host cells, such as Pichia cells and 

25 bacterial cells, such as E. coli, and the proteins expressed therein. Preferred 
Pichia strains, include, for example, GS115. Preferred bacterial hosts contain 
chromosomal copies of DNA encoding T7 RNA polymerase operably linked to an 
inducible promoter, such as the lacliV promoter (see, U.S. Patent No. 
4,952,496). Such hosts include, but are not limited to, the lysogenic E. coli 

30 strain BL2KDE3). 
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Expression and production of proteins 
The MTSP domains, derivatives and analogs be produced by various 
methods known in the art. For example, once a recombinant cell expressing an 
MTSP protein, or a domain, fragment or derivative thereof, is identified, the 
5 individual gene product can be isolated and analyzed. This is achieved by 
assays based on the physical and/or functional properties of the protein, 
including, but not limited to, radioactive labeling of the product followed by 
analysis by gel electrophoresis, immunoassay, cross-linking to marker-labeled 
product. The MTSP protein proteins may be isolated and purified by standard 
10 methods known in the art (either from natural sources or recombinant host cells 
expressing the complexes or proteins), including but not restricted to column 
chromatography (e.g., ion exchange, affinity, gel exclusion, reversed-phase high 
pressure, fast protein liquid, etc.), differential centrifugation, differential 
solubility, or by any other standard technique used for the purification of 
15 proteins. Functional properties may be evaluated using any suitable assay 
known in the art. 

Alternatively, once an MTSP protein or its domain or derivative is 
identified, the amino acid sequence of the protein can be deduced from the 
nucleotide sequence of the gene which encodes it. As a result, the protein or its 
domain or derivative can be synthesized by standard chemical methods known in 
the art (e.g. see Hunkapiller et al, Nature 310 :105-1 1 1 (1984)). 

Manipulations of MTSP protein sequences may be made at the protein 
level. Also contemplated herein are MTSP protein proteins, domains thereof, 
derivatives or analogs or fragments thereof, which are differentially modified 
during or after translation, e.g., by glycosylation, acetylation, phosphorylation, 
amidation, derivatization by known protecting/blocking groups, proteolytic 
cleavage, linkage to an antibody molecule or other cellular ligand, etc. Any of 
numerous chemical modifications may be carried out by known techniques, 
including but not limited to specific chemical cleavage by cyanogen bromide, 
trypsin, chymotrypsin, papain, V8 protease, NaBH 4 , acetylation, formylation, 
oxidation, reduction, metabolic synthesis in the presence of tunicamycin, etc. 
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In addition, domains, analogs and derivatives of an MTSP protein can b 
chemically synthesized. For example, a peptide corresponding to a portion of an 
MTSP protein, which includes the desired domain or which mediates the desired 
activity in vitro can be synthesized by use of a peptide synthesizer. 
5 Furthermore, if desired, nonctassical amino acids or chemical amino acid analogs 
can be introduced as a substitution or addition into the MTSP protein sequence. 
Non-classical amino acids include but are not limited to the D-isomers of the 
common amino acids, a-amino isobutyric acid, 4-aminobutyric acid, Abu, 
2-aminobutyric acid, e-Abu, e-Ahx, 6-amino hexanoic acid, Aib, 2-amino 

10 isobutyric acid, 3-amino propionoic acid, ornithine, norleucine, norvaline, 

hydroxyproline, sarcosine, citrulline, cysteic acid, t-butylglycine, t-butylalanine, 
phenylglycine, cyclohexylalanine, ^-alanine, fluoro-amino acids, designer amino 
acids such as G-methyl amino acids, Ca-methyl amino acids, Na-methyl amino 
acids, and amino acid analogs in general. Furthermore, the amino acid can be D 

1 5 (dextrorotary) or L {levorotaryh 

In cases where natural products are suspected of being mutant or are 
isolated from new species, the amino acid sequence of the MTSP protein 
isolated from the natural source, as well as those expressed in vitro, or from 
synthesized expression vectors in vivo or in vitro, can be determined from 

20 analysis of the DNA sequence, or alternatively, by direct sequencing of the 
isolated protein. Such analysis may be performed by manual sequencing or 
through use of an automated amino acid sequenator. 

Modifications 

A variety of modification of the MTSP proteins and domains are 
25 contemplated herein. An MTSP-encoding nucleic acid molecule modified by any 
of numerous strategies known in the art (Sambrook et al., 1990, Molecular 
Cloning, A Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory, Cold 
Spring Harbor, New York). The sequences can be cleaved at appropriate sites 
with restriction endonuclease(s), followed by further enzymatic modification if 
30 desired, isolated, and ligated in vitro. In the production of the gene encoding a 
domain, derivative or analog of MTSP, care should be taken to ensure that the 
modified gene retains the original translational reading frame, uninterrupted by 
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translationai stop signals, in the gene region where the desired activity is 
encoded. 

Additionally, the MTSP-encoding nucleic acid molecules can be mutated 
in vitro or in vivo, to create and/or destroy translation, initiation, and/or 
5 termination sequences, or to create variations in coding regions and/or form new 
restriction endonuclease sites or destroy pre-existing ones, to facilitate further in 
vitro modification. Also, as described herein muteins with primary sequence 
alterations, such as replacements of Cys residues and elimination of 
glycosylation sites are contemplated. Such mutations may be effected by any 

10 technique for mutagenesis known in the art, including, but not limited to, 

chemical mutagenesis and in vitro site-directed mutagenesis (Hutchinson et al., 
J. Biol. Chem. 253 :6551-6558 (1978)), use of TAB® linkers (Pharmacia). In one 
embodiment, for example, an MTSP protein or domain thereof is modified to 
include a fluorescent label. In other specific embodiments, the MTSP protein is 

15 modified to have a heterofunctional reagent, such heterofunctional reagents can 
be used to crosslink the members of the complex. 

The MTSP proteins may be isolated and purified by standard methods 
known in the art (either from natural sources or recombinant host cells 
expressing the complexes or proteins), including but not restricted to column 

20 chromatography (e.g., ion exchange, affinity, gel exclusion, reversed-phase high 
pressure, fast protein liquid, etc.), differential centrifugation, differential 
solubility, or by any other standard technique used for the purification of 
proteins. Functional properties may be evaluated using any suitable assay 
known in the art. 

25 Alternatively, once a MTSP or its domain or derivative is identified, the 

amino acid sequence of the protein can be deduced from the nucleotide 
sequence of the gene which encodes it. As a result, the protein or its domain or 
derivative can be synthesized by standard chemical methods known in the art 
(e.g., see Hunkapiller et al. Nature, 310 :105-1 1 1 (1984)). 

30 Manipulations of MTSP sequences may be made at the protein level. 

MTSP domains, derivatives or analogs or fragments, which are differentially 
modified during or after translation, e.g., by glycosylation, acetylation, 
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phosphorylation, amidation, derivatization by known protecting/blocking groups, 
proteolytic cleavage, linkage to an antibody molecule and other cellular ligand, 
are contemplated herein. Any of numerous chemical modifications may be 
carried out by known techniques, including but not limited to specific chemical 
5 cleavage by cyanogen bromide, trypsin, chymotrypsin, papain, V8 protease, 
NaBH 4 , acetylation, formylation, oxidation, reduction, metabolic synthesis in the 
presence of tunicamycin, etc. 

In addition, domains, analogs and derivatives of a MTSP can be 
chemically synthesized. For example, a peptide corresponding to a portion of a 

10 MTSP, which comprises the desired domain or which mediates the desired 
activity in vitro can be synthesized by use of a peptide synthesizer. 
Furthermore, rf desired, nonclassical amino acids or chemical amino acid analogs 
can be introduced as a substitution or addition into the MTSP sequence. Non- 
classical amino acids include but are not limited to the D-isomers of the common 

15 amino acids, a-amino isobutyric acid, 4-aminobutyric acid, Abu, 2-aminobutyric 
acid, £-Abu, e-Ahx, 6-amino hexanoic acid, Aib, 2-amino isobutyric acid, 
3-amino propionoic acid, ornithine, norleucine, norvaline, hydroxyproline, 
sarcosine, citrulline, cysteic acid, t-butylglycine, t-butylalanine, phenylglycine, 
cyclohexylalanine, ^-alanine, fluoro-amino acids, designer amino acids such as &- 

20 methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, and amino 
acid analogs in general. Furthermore, the amino acid can be D (dextrorotary) or 
L (levorotary). 

F. SCREENING METHODS 

The single chain protease domains, as shown herein, can be used in a 
25 variety of methods to identify compounds that modulate the activity thereof. For 
MTSPs that exhibit higher activity or expression in tumor cells, compounds that 
inhibit the proteolytic activity are of particular interest. For any MTSPs that are 
active at lower levels in tumor cells, compounds or agents that enhance the 
activity are potentially of interest. In all instances the identified compounds will 
30 include agents that are candidate cancer treatments. 

Several types of assays are exemplified and described herein. It is 
understood that the protease domains may be used in other assays. It is shown 
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here, however, that the single chain protease domains exhibit catalytic activity. 

As such they are ideal for in vitro screening assays. 

They may also be used in binding assays. 

The MTSP3, MTSP4 and MTSP6 full length zymogens, activated 

5 enzymes, single and double chain protease domains are contemplated for use in 

any screening assay known to those of skill in the art, including those provided 

herein. Hence the following description, if directed to proteolytic assays is 

intended to apply to use of a single chain protease domain or a catalytically 

active portion thereof of any MTSP, including an MTSP3, MTSP4 or an MTSP6. 

10 Other assays, such as binding assays are provided herein, particularly for use 

with an MTSP3, MTSP4 or MTSP6, including any variants, such as splice 

variants thereof. MTSP3 and MTSP4 are of most interest in such assays. 

1 . Catalytic Assays for identification of agents that modulate the 
protease activity of an MTSP protein 

15 , Methods for identifying a modulator of the catalytic activity of an MTSP, 

particularly a single chain protease domain or catalytically active portion thereof, 
are provided herein. The methods can be practiced by: a) contacting the MTSP, 
particularly a single-chain domain thereof, with a substrate of the MTSP in the 
presence of a test substance, and detecting the proteolysis of the substrate, 

20 whereby the activity of the MTSP is assessed, and comparing the activity to a 
control. For example, the control can be the activity of the MTSP assessed by 
contacting an MTSP, particularly a single-chain domain thereof, with a substrate 
of the MTSP, and detecting the proteolysis of the substrate, whereby the 
activity of the MTSP is assessed. The results in the presence and absence of 

25 the test compounds are compared. A difference in the activity indicates that the 
test substance modulates the activity of the MTSP. 

In one embodiment a plurality of the test substances are screened 
simultaneously in the above screening method. In another embodiment, the 
MTSP is isolated from a target cell as a means for then identifying agents that 

30 are potentially specific for the target cell. 

In still another embodiment, The test substance is a therapeutic 
compound, and whereby a difference of the MTSP activity measured in the 
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presence and in the absence of the test substance indicates that the target cell 
responds to the therapeutic compound. 

One method include the steps of (a) contacting the MTSP protein or 
protease domain thereof with one or a plurality of test compounds under 
5 conditions conducive to interaction between the ligand and the compounds; and 
(b) identifying one or more compounds in the plurality that specifically binds to 
the ligand. 

Another method provided herein includes the steps of a) contacting an 
MTSP protein or protease domain thereof with a substrate of the MTSP protein, 
10 and detecting the proteolysis of the substrate, whereby the activity of the MTSP 
protein is assessed; b) contacting the MTSP protein with a substrate of the 
MTSP protein in the presence of a test substance, and detecting the proteolysis 
of the substrate, whereby the activity of the MTSP protein is assessed; and c) 
comparing the activity of the MTSP protein assessed in steps a) and b), whereby 
15 the activity measured in step a) differs from the activity measured in step b) 
indicates that the test substance modulates the activity of the MTSP protein. 

In another embodiment, a plurality of the test substances are screened 
simultaneously. In comparing the activity of an MTSP protein in the presence 
and absence of a test substance to assess whether the test substance is a 
20 modulator of the MTSP protein, it is unnecessary to assay the activity in parallel, 
although such parallel measurement is preferred. It is possible to measure the 
activity of the MTSP protein at one time point and compare the measured 
activity to a historical value of the activity of the MTSP protein. 

For instance, one can measure the activity of the MTSP protein in the 
25 presence of a test substance and compare with historical value of the activity of 
the MTSP protein measured previously in the absence of the test substance, and 
vice versa. This can be accomplished, for example, by providing the activity of 
the MTSP protein on an insert or pamphlet provided with a kit for conducting the 
assay. 

30 Methods for selecting substrates for a particular MTSP are described in 

the EXAMPLES, and particular proteolytic assays are exemplified. 
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Combinations and kits containing the combinations optionally including 
instructions for performing the assays are provided. The combinations include 
an MTSP protein and a substrate of the MTSP protein to be assayed; and, 
optionally reagents for detecting proteolysis of the substrate. The substrates, 
5 which are typically proteins subject to proteolysis by a particular MTSP protein, 
can be identified empirically by testing the ability of the MTSP protein to cleave 
the test substrate. Substrates that are cleaved most effectively (i.e., at the 
lowest concentrations and/or fastest rate or under desirable conditions), are 
identified. 

10 Additionally provided herein is a kit containing the above-described 

combination. Preferably, the kit further includes instructions for identifying a 
modulator of the activity of an MTSP protein. Any MTSP protein is 
contemplated as target for identifying modulators of the activity thereof. 
2. Binding assays 

15 Also provided herein are methods for identification and isolation of 

agents, particularly compounds that bind to MTSPs. The assays are designed to 
identify agents that bind to the zymogen form, the single chain isolated protease 
domain (or a protein, other than an MTSP protein, that contains the protease 
domain of an MTSP protein), and to the activated form, including the activated 

20 form derived from the full length zymogen or from an extended protease domain. 
The identified compounds are candidates or leads for identification of 
compounds for treatments of tumors and other disorders and diseases involving 
aberrant angiogenesis. The MTSP proteins used in the methods include any 
MTSP protein as defined herein, and preferably use MTSP single chain domain 

25 or proteolytically active portion thereof. 

A variety of methods are provided herein. These methods may be 
performed in solution or in solid phase reactions in which the MTSP protein(s) or 
protease domain(s) thereof are linked, either directly or indirectly via a linker, to 
a solid support. Screening assays are described in the Examples, and these 

30 assays have been used to identify candidate compounds. 

For purposes herein, all binding assays described above are provided for 
MTSP3, MTSP4 and MTSP6. For MTSP1 (including any variant thereof) and 
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thereof) and other such proteases, binding assays that employ the isolated single 
chain protease domain or a protein containing such domain (other than the MTSP 
from which the protease is derived) are provided. 

Methods for identifying an agent, such as a compound, that specifically 
5 binds to an MTSP single chain protease domain or an MTSP, such as an MTSP3, 
MTSP4 or an MTSP6, are provided herein. The method can be practiced by (a) 
contacting the MTSP with one or a plurality of test agents under conditions 
conducive to binding between the MTSP and an agent; and (b) identifying one or 
more agents within the plurality that specifically binds to the MTSP. 

10 For example, in practicing such methods the MTSP polypeptide is mixed with a 
potential binding partner or an extract or fraction of a cell under conditions that 
allow the association of potential binding partners with the polypeptide. After 
mixing, peptides, polypeptides, proteins or other molecules that have become 
associated with an MTSP are separated from the mixture. The binding partner 

15 that bound to the MTSP can then be removed and further analyzed. To identify 
and isolate a binding partner, the entire protein, for instance the entire disclosed 
protein of SEQ ID Nos. 6, 8 10 or 12 can be used. Alternatively, a fragment of 
the protein can be used. 

A variety of methods can be used to obtain cell extracts. Cells can be 

20 disrupted using either physical or chemical disruption methods. Examples of 
physical disruption methods include, but are not limited to, sonication and 
mechanical shearing. Examples of chemical lysis methods include, but are not 
limited to, detergent lysis and enzyme lysis. A skilled artisan can readily adapt 
methods for preparing cellular extracts in order to obtain extracts for use in the 

25 present methods. 

Once an extract of a cell is prepared, the extract is mixed with the MTSP 
under conditions in which association of the protein with the binding partner can 
occur. A variety of conditions can be used, the most preferred being conditions 
that closely resemble conditions found in the cytoplasm of a human cell. 

30 Features such as osmolality, pH, temperature, and the concentration of cellular 
extract used, can be varied to optimize the association of the protein with the 
binding partner. 
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After mixing under appropriate conditions, the bound complex is 
separated from the mixture. A variety of techniques can be used to separate the 
mixture. For example, antibodies specific to an MTSP can be used to 
immunoprecipitate the binding partner complex. Alternatively, standard chemical 
5 separation techniques such as chromatography and density/sediment 
centrifugation can be used. 

After removing the non-associated cellular constituents in the extract, the 
binding partner can be dissociated from the complex using conventional 
methods. For example, dissociation can be accomplished by altering the salt 
10 concentration or pH of the mixture. 

To aid in separating associated binding partner pairs from the mixed 
extract, the MTSP can be immobilized on a solid support. For example, the 
protein can be attached to a nitrocellulose matrix or acrylic beads. Attachment 
of the protein or a fragment thereof to a solid support aids in separating 
15 peptide/binding partner pairs from other constituents found in the extract. The 
identified binding partners can be either a single protein or a complex made up of 
two or more proteins. 

Alternatively, the nucleic acid molecules encoding the single chain 
proteases can be used in a yeast two-hybrid system. The yeast two-hybrid 
20 system has been used to identify other protein partner pairs and can readily be 
adapted to employ the nucleic acid molecules herein described. 

Another in vitro binding assay, particularly for an MTSP3, MTSP4 or an 
MTSP6 uses a mixture of a polypeptide that contains at least the catalytic 
domain of one of these proteins and one or more candidate binding targets or 
25 substrates. After incubating the mixture under appropriate conditions, one 

determines whether the MTSP or a polypeptide fragment thereof containing the 
catalytic domain binds with the candidate substrate. For cell-free binding 
assays, one of the components includes or is coupled to a detectable label. The 
label may provide for direct detection, such as radioactivity, luminescence, 
30 optical or electron density, etc., or indirect detection such as an epitope tag, an 
enzyme, etc. A variety of methods may be employed to detect the label 
depending on the nature of the label and other assay components. For example, 
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the label may be detected bound to the solid substrate or a portion of the bound 
complex containing the label may be separated from the solid substrate, and the 
label thereafter detected. 

3. Detection of signal transduction 

5 The cell surface location of the MTSPs suggests a role for some or all of 

these proteins in signal transduction. Assays for assessing signal transduction 
are well known to those of skill in the art, and may be adapted for use with the 
MTSP protein. 

Assays for identifying agents that effect or alter signal transduction 
10 mediated by an MTSP, particularly the full length or a sufficient portion to anchor 
the extracellular domain or a function portion thereof of an MTSP on the surface 
of a cell are provided. Such assays, include, for example, transcription based 
assays in which modulation of a transduced signal is assessed by detecting an 
effect on an expression from a reporter gene (see, e.g., U.S. Patent No. 
15 5,436,128). 

4. Methods for Identifying Agents that Modulate the Expression a 
Nucleic Acid Encoding an MTSP, particularly an MTSP3, MTSP4 or 
MTSP6 

Another embodiment provides methods for identifying agents that 
20 modulate the expression of a nucleic acid encoding an MTSP, particularly an 
MTSP3, MTSP4 or MTSP. Such assays use any available means of monitoring 
for changes in the expression level of the nucleic acids encoding an MTSP, such 
as MTSP3 or MTSP4. 

In one assay format, cell lines that contain reporter gene fusions between 
25 the open reading frame of MTSP3, MTSP4 or MTSP6 or a domain thereof, 
particularly the protease domain and any assayable fusion partner may be 
prepared. Numerous assayable fusion partners are known and readily available 
including the firefly luciferase gene and the gene encoding chloramphenicol 
acetyltransferase (Alam et aL, Anal. Biochem. 188: 245-54 (1990)). Cell lines 
30 containing the reporter gene fusions are then exposed to the agent to be tested 
under appropriate conditions and time. Differential expression of the reporter 
gene between samples exposed to the agent and control samples identifies 
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agents which modulate the expression of a nucleic acid encoding an MTSP3, 
MTSP4 or MTSP6. 

Additional assay formats may be used to monitor the ability of the agent 
to modulate the expression of a nucleic acid encoding an MTSP3, MTSP4 or 
5 MTSP6. For instance, mRNA expression may be monitored directly by 

hybridization to the nucleic acids. Cell lines are exposed to the agent to be 
tested under appropriate conditions and time and total RNA or mRNA is isolated 
by standard procedures (see, e.g., Sambrook et all (1989) MOLECULAR 
CLONING: A LABORATORY MANUAL, 2nd Ed. Cold Spring Harbor Laboratory 

10 Press). Probes to detect differences in RNA expression levels between cells 

exposed to the agent and control cells may be prepared from the nucleic acids. 
It is preferable, but not necessary, to design probes which hybridize only with 
target nucleic acids under conditions of high stringency. Only highly 
complementary nucleic acid hybrids form under conditions of high stringency. 

15 Accordingly, the stringency of the assay conditions determines the amount of 

complementarity which should exist between two nucleic acid strands in order to 
form a hybrid. Stringency should be chosen to maximize the difference in 
stability between the probe:target hybrid and potential probe:non-target hybrids. 
Probes may be designed from the nucleic acids through methods known 

20 in the art. For instance, the G-f C content of the probe and the probe length can 
affect probe binding to its target sequence. Methods to optimize probe 
specificity are commonly available (see, e.g., Sambrook et aL (1989) 
MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed. Cold Spring 
Harbor Laboratory Press); and Ausubel et aL (1995) CURRENT PROTOCOLS IN 

25 MOLECULAR BIOLOGY, Greene Publishing Co., NY). 

Hybridization conditions are modified using known methods (see, e.g., 
Sambrook et aL (1989) MOLECULAR CLONING: A LABORATORY MANUAL, 
2nd Ed. Cold Spring Harbor Laboratory Press); and Ausubel et aL (1995) 
CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, Greene Publishing Co., NY), 

30 as required for each probe. Hybridization of total cellular RNA or RNA enriched 
for polyA RNA can be accomplished in any available format. For instance, total 
cellular RNA or RNA enriched for polyA RNA can be affixed to a solid support, 
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and the solid support exposed to at least one probe .comprising at least one, or 

part of one of the nucleic acid molecules under conditions in which the probe 

will specifically hybridize. Alternatively, nucleic acid fragments comprising at 

least one, or part of one of the sequences can be affixed to a solid support, such 

5 as a porous glass wafer. The glass wafer can then be exposed to total cellular 

RNA or polyA RNA from a sample under conditions in which the affixed 

sequences will specifically hybridize. Such glass wafers and hybridization 

methods are widely available, for example, those disclosed by Beattie (WO 

95/1 1755). By examining for the ability of a given probe to specifically hybridize 

10 to an RNA sample from an untreated cell population and from a cell population 

exposed to the agent, agents which up or down regulate the expression of a 

nucleic acid encoding the protein having the sequence of SEQ ID NO:3 or SEQ ID 

NO:4 are identified. 

5. Methods for Identifying Agents that Modulate at Least One 
15 Activity of an MTPS, such as MTSP3, MTSP4 or MTSP6 

Methods for identifying agents that modulate at least one activity of a an 

MTSP, such as an MTSP3, MTSP4 or MTSP6 are provided. Such methods or 

assays may use any means of monitoring or detecting the desired activity. 

In one format, the relative amounts of a protein between a cell population 

20 that has been exposed to the agent to be tested compared to an un-exposed 
control cell population may be assayed (e.g., a prostate cancer cell line, a lung 
cancer cell line, a colon cancer cell line or a breast cancer cell line). In this 
format, probes, such as specific antibodies, are used to monitor the differential 
expression of the protein in the different cell populations. Cell lines or 

25 populations are exposed to the agent to be tested under appropriate conditions 
and time. Cellular lysates may be prepared from the exposed cell line or 
population and a control, unexposed cell line or population. The cellular lysates 
are then analyzed with the probe. 

For example, N- and C- terminal fragments of the MTSP can be expressed 

30 in bacteria and used to search for proteins which bind to these fragments. 

Fusion proteins, such as His-tag or GST fusion to the N- or C-terminal regions of 
the MTSP, such as an MTSP3', MTSP4 or an MTSP6, can be prepared for use as 
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a substrate. These fusion proteins can be coupled to, for example, Glutathione- 
Sepharose beads and then probed with cell lysates. Prior to lysis, the cells may 
be treated with a candidate agent which may modulate an MTSP, such as an 
MTSP3, MTSP4 or an MTSP6, or proteins that interact with domains thereon. 
5 Lysate proteins binding to the fusion proteins can be resolved by SDS-PAGE, 
isolated and identified by protein sequencing or mass spectroscopy, as is known 
in the art. 

Antibody probes are prepared by immunizing suitable mammalian hosts in 
appropriate immunization protocols using the peptides, polypeptides or proteins 
10 if they are of sufficient length (e.g., 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 
25, 30, 35, 40 or more consecutive amino acids the MTSP protein, such as an 
MTSP3, an MTSP4 or an MTSP6), or if required to enhance immunogenicity, 
conjugated to suitable carriers. Methods for preparing immunogenic conjugates 
with carriers, such as bovine serum albumin (BSA), keyhole limpet hemocyanin 

15 (KLH), or other carrier proteins are well known in the art. In some 

circumstances, direct conjugation using, for example, carbodiimide reagents may 
be effective; in other instances linking reagents such as those supplied by Pierce 
Chemical Co., Rockford, IL, may be desirable to provide accessibility to the 
hapten. Hapten peptides can be extended at either the amino or carboxy 

20 terminus with a Cys residue or interspersed with cysteine residues, for example, 
to facilitate linking to a carrier. Administration of the immunogens is conducted 
generally by injection over a suitable time period and with use of suitable 
adjuvants, as is generally understood in the art. During the immunization 
schedule, titers of antibodies are taken to determine adequacy of antibody 

25 formation. 

Anti-peptide antibodies can be generated using synthetic peptides 
corresponding to, for example, the carboxy terminal amino acids of the MTSP. 
Synthetic peptides can be as small as 1-3 amino acids in length, but are 
preferably at least 4 or more amino acid residues long. The peptides can be 
30 coupled to KLH using standard methods and can be immunized into animals, 
such as rabbits or ungulate. Polyclonal antibodies can then be purified, for 
example using Actigel beads containing the covalently bound peptide. 
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While the polyclonal antisera produced in this way may be satisfactory for 
some applications, for pharmaceutical compositions, use of monoclonal 
preparations is preferred. Immortalized cell lines which secrete the desired 
monoclonal antibodies may be prepared using the standard method of Kohler et 
5 a/., {Nature 256: 495-7 (1975)) or modifications which effect immortalization of 
lymphocytes or spleen cells, as is generally known. The immortalized cell lines 
secreting the desired antibodies are screened by immunoassay in which the 
antigen is the peptide hapten, polypeptide or protein. When the appropriate 
Immortalized cell culture secreting the desired antibody is identified, the cells can 

10 be cultured either in vitro or by production in vivo via ascites fluid. Of particular 
interest, are monoclonal antibodies that recognize the catalytic domain of an 
MTSP, such as an MTSP3, MTSP4 or an MTSP6. 

Additionally, the zymogen or two-chain forms the MTSP can be used to 
make monoclonal antibodies which recognize conformation epitopes. For 

15 peptide-directed monoclonal antibodies, peptides from the C1r/C1s domain can 
be used to generate anti-C1r/C1s domain monoclonal antibodies which can 
thereby block activation of the zymogen to the two-chain form of the MTSP. 
This domain can similarly be the substrate for other non-antibody compounds 
which bind to these preferred domains on either the single-chain or double-chain 

20 forms of the MTSP3, MTSP4 or MTSP6, and thereby modulate the activity of 
thereof or prevent its activation. 

The desired monoclonal antibodies are then recovered from the culture 
supernatant or from the ascites supernatant. Fragments of the monoclonals or 
the polyclonal antisera which contain the immunologically significant portion can 

25 be used as antagonists, as well as the intact antibodies. Use of immunologically 
reactive fragments, such as the Fab, Fab', of F(ab') 2 fragments are often 
preferable, especially in a therapeutic context, as these fragments are generally 
less immunogenic than the whole immunoglobulin. 

The antibodies or fragments may also be produced. Regions that bind 

30 specifically to the desired regions of receptor can also be produced in the 
context of chimeras with multiple species origin. 
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Agents that are assayed in the above method can be randomly selected 
or rationally selected or designed. 

The agents can be, as examples, peptides, small molecules, and 
carbohydrates. A skilled artisan can readily recognize that there is no limit as to 
5 the structural nature of the agents. 

The peptide agents can be prepared using standard solid phase (or 
solution phase) peptide synthesis methods, as is known in the art. In addition, 
the DNA encoding these peptides may be synthesized using commercially 
available oligonucleotide synthesis instrumentation and produced recombinantly 
10 using standard recombinant production systems. The production using solid 

phase peptide synthesis is necessitated if non-gene-encoded amino acids are to 
be included. 

G. Assay formats and selection of test substances 

A variety of formats and detection protocols are known for performing 
15 screening assays. Any such formats and protocols may be adapted for 

identifying modulators of MTSP protein activities. The following includes a 
discussion of exemplary protocols. 

1 . High throughput screening assays 

Although the above-described assay can be conducted where a single 
20 MTSP protein is screened, and/or a single test substance is screened for in one 
assay, the assay is preferably conducted in a high throughput screening mode, 
Le., a plurality of the MTSP proteins are screened against and/or a plurality of 
the test substances are screened for simultaneously (See generally, High 
Throughput Screening: The Discovery of Bioactive Substances (Devlin, Ed.) 
25 Marcel Dekker, 1997; Sittampalam et al., Curr. Opln. Chem. Biol., 1(31:384-91 
(1997); and Silverman et aL, Curr. Opln. Chem. Biol., 2(31:397-403 (1998)). For 
example, the assay can be conducted in a multi-well (e.g., 24-, 48-, 96-, or 384- 
well), chip or array format. 

High-throughput screening (HTS) is the process of testing a large number 
30 of diverse chemical structures against disease targets to identify "hits" 

(Sittampalam et al., Curr. Opin. Chem. Biol., 1(3) :384-91 (1997)). Current 
state-of-the-art HTS operations are highly automated and computerized to handle 
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sample preparation, assay procedures and the subsequent processing of large 
volumes of data. 

Detection technologies employed in high-throughput screens depend on 
the type of biochemical pathway being investigated (Sittampalam et al. # Curr. 
5 Opin. Chem. BioL, 1 (3) :384-91 (1997)). These methods include, radiochemical 
methods, such as the scintillation proximity assays (SPA), which can be adapted 
to a variety of enzyme assays (Lerner et al., J. Biomol. Screening, J_: 135-143 
(1996); Baker et al.. Anal. Biochem., 239 :20-24 (1996); Baum et al., Anal. 
Biochem., 237 :129-134 (1996); and Sullivan et al., J. Biomol. Screening, 2:19- 

10 23 (1997)) and protein-protein interaction assays (Braunwalder et al., J. Biomol. 
Screening, 1:23-26 (1996); Sonatore et al.. Anal. Biochem., 240 :289-297 
(1996); and Chen et al., J. Biol. Chem., 271 :25308-25315 (1996)), and non- 
isotopic detection methods, including but are not limited to, colorimetric and 
luminescence detection methods, resonance energy transfer (RET) methods, 

15 time-resolved fluorescence (HTRF) methods, cell-based fluorescence assays, 
such as fluorescence resonance energy transfer (FRET) procedures (see, 
e.g., Gonzalez et al., Biophys. J., 69:1272-1280 (1995)), fluorescence 
polarization or anisotropy methods (see, e.g., Jameson et al., Methods EnzymoL, 
246 :283-300 (1995); Jolley, J. Biomol. Screening, 1:33-38 (1996); Lynch et al., 

20 Anal. Biochem., 247 :77-82 (1997)), fluorescence correlation spectroscopy (FCS) 
and other such methods. 

2. Test Substances 
Test compounds, including small molecules and libraries and collections 
thereof can be screened in the above-described assays and assays described 

25 below to identify compounds that modulate the activity an MTSP protein. 

Rational drug design methodologies that rely on computational chemistry may be 
used to screen and identify candidate compounds. 

The compounds identified by the screening methods include inhibitors, 
including antagonists, and may be agonists Compounds for screening are any 

30 compounds and collections of compounds available, know or that can be 
prepared. 
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a. Selection of Compounds 

Compounds can be selected for their potency and selectivity of inhibition 
of serine proteases, especially MTSP protein. As described herein, and as 
generally known, a target serine protease and its substrate are combined under 
5 assay conditions permitting reaction of the protease with its substrate. The 
assay is performed in the absence of test compound, and in the presence of 
increasing concentrations of the test compound. The concentration of test 
compound at which 50% of the serine protease activity is inhibited by the test 
compound is the IC 50 value (Inhibitory Concentration) or EC 50 {Effective 

10 Concentration) value for that compound. Within a series or group of test 

compounds, those having lower IC 50 or EC 50 values are considered more potent 
inhibitors of the serine protease than those compounds having higher IC 50 or 
EC 50 values. The IC 50 measurement is often used for more simplistic assays, 
whereas the EC 50 is often used for more complicated assays, such as those 

15 employing cells. 

Preferred compounds according to this aspect have an IC 50 value of 100 
nM or less as measured in an in vitro assay for inhibition of MTSP protein 
activity. Especially preferred compounds have an IC 50 value of less than 100 
nM. 

20 The test compounds also are evaluated for selectivity toward a serine 

protease. As described herein, and as generally known, a test compound is 
assayed for its potency toward a panel of serine proteases and other enzymes 
and an IC 50 value or EC 60 value is determined for each test compound in each 
assay system. A compound that demonstrates a low IC S0 value or EC 50 value for 

25 the target enzyme, e.g., MTSP protein, and a higher IC 50 value or EC 50 value for 
other enzymes within the test panel (e.g., urokinase tissue plasminogen 
activator, thrombin, Factor Xa), is considered to be selective toward the target 
enzyme. Generally, a compound is deemed selective if its IC 50 value or EC^ 
value in the target enzyme assay is at least one order of magnitude less than the 

30 next smallest IC^ value or EC 50 value measured in the selectivity panel of 
enzymes. 
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Presently preferred compounds have an IC 50 value of 100 nM or less as 
measured in an in vitro assay for inhibition of urokinase activity. Especially 
preferred compounds have an IC S0 value in the in vitro urokinase inhibition assay 
that is at least one order of magnitude smaller than the IC 50 value measured in 
5 the in vitro tPA inhibition assay. Compounds having a selectivity ratio of IC e „ u- 
PA assay: IC S0 MTSP protein assay of greater than 100 are especially preferred. 

Compounds are also evaluated for their activity in vivo. The type of 
assay chosen for evaluation of test compounds will depend on the pathological 
condition to be treated or prevented by use of the compound, as well as the 
10 route of administration to be evaluated for the test compound. 

For instance, to evaluate the activity of a compound to reduce tumor 
growth through inhibition of MTSP protein, the procedures described by Jankun 
et al.. Cane. Fes., 57:559-563 (1997) to evaluate PAI-1 can be employed. 
Briefly, the ATCC cell lines DU145 and LnCaP are injected into SCID mice. After 
1 5 tumors are established, the mice are given test compound according to a dosing 
regime determined from the compound's in vitro characteristics. The Jankun et 
al. compound was administered in water. Tumor volume measurements are 
taken twice a week for about five weeks. A compound is deemed active if an 
animal to which the compound was administered exhibited decreased tumor 
!0 volume, as compared to animals receiving appropriate control compounds. 

Another in vivo experimental model designed to evaluate the effect of p- 
aminobenzamidine, a swine protease inhibitor, on reducing tumor volume is 
described by Billstrom et al.. Int. J. Cancer, 61:542-547 (1995). 

To evaluate the ability of a compound to reduce the occurrence of, or 
inhibit, metastasis, the procedures described by Kobayashi et al., Int. J. Cane, 
57:727-733d (1994) can be employed. Briefly, a murein xenograft selected for 
high lung colonization potential in injected into C57B1/6 mice i.v. (experimental 
metastasis) or s.c. into the abdominal wall (spontaneous metastasis). Various 
concentrations of the compound to be tested can be admixed with the tumor 
cells in Matrigel prior to injection. Daily i.p. injections of the test compound are 
made either on days 1-6 or days 7-13 after tumor inoculation. The animals are 
sacrificed about three or four weeks after tumor inoculation, and the lung tumor 
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colonies are counted. Evaluation of the resulting data permits a determination as 
to efficacy of the test compound, optima! dosing and route of administration. 

The activity of the tested compounds toward decreasing tumor volume 
and metastasis can be evaluated in model described in Rabbani et al., Int. J. 
5 Cancer 63:840-845 (1 995) to evaluate their inhibitor. There, Mat LyLu tumor 
cells were injected into the flank of Copenhagen rats. The animals were 
implanted with osmotic minipumps to continuously administer various doses of 
test compound for up to three weeks. The tumor mass and volume of 
experimental and control animals were evaluated during the experiment, as were 

10 metastatic growths. Evaluation of the resulting data permits a determination as 
to efficacy of the test compound, optimal dosing, and route of administration. 
Some of these authors described a related protocol in Xing et al., Cane. Res. t 
57:3585-3593 (1997). 

To evaluate the inhibitory activity of a compound, a rabbit cornea 

15 neovascularization model can be employed. Avery et a\.,Arch* OphthalmoL, 
108 :1474-1475 (1990) describe anesthetizing New Zealand albino rabbits and 
then making a central corneal incision and forming a radial corneal pocket. A 
slow release prostaglandin pellet was placed in the pocket to induce 
neovascularization. Test compound was administered i.p, for five days, at which 

20 time the animals were sacrificed. The effect of the test compound is evaluated 
by review of periodic photographs taken of the limbus, which can be used to 
calculate the area of neovascular response and, therefore, limbal 
neovascularization. A decreased area of neovascularization as compared with 
appropriate controls indicates the test compound was effective at decreasing or 

25 inhibiting neovascularization. 

An angiogenesis model used to evaluate the effect of a test compound in 
preventing angiogenesis is described by Min et al.. Cane. Res., 56:2428-2433 
(1996). C57BL6 mice receive subcutaneous injections of a Matrigel mixture 
containing bFGF, as the angiogenesis-inducing agent, with and without the test 

30 compound. After five days, the animals are sacrificed and the Matrigel plugs, in 
which neovascularization can be visualized, are photographed. An experimental 
animal receiving Matrigel and an effective dose of test compound will exhibit 
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less vascularization than a control animal or an experimental animal receiving a 
less- or non-effective does of compound. 

An in vivo system designed to test compounds for their ability to limit the 
spread of primary tumors is described by Crowley et al., Proc. Natl. Acad. Sci., 
5 90:5021-5025 (1993). Nude mice are injected with tumor cells (PC3) 

engineered to express CAT (chloramphenicol acetyltransferase). Compounds to 
be tested for their ability to decrease tumor size and/or metastases are 
administered to the animals, and subsequent measurements of tumor size and/or 
metastatic growths are made. In addition, the level of CAT detected in various 
10 organs provides an indication of the ability of the test compound to inhibit 

metastasis; detection of less CAT in tissues of a treated animal versus a control 
animal indicates less CAT-expressing cells migrated to that tissue. 

In vivo experimental modes designed to evaluate the inhibitory potential 
of a test serine protease inhibitors, using a tumor cell line F3II, the to be highly 
15 invasive, are described by Alonso et al., Breast Cane. Res. Treat., 40:209-223 
(1996). This group describes in vivo studies for toxicity determination, tumor 
growth, invasiveness, spontaneous metastasis, experimental lung metastasis, 
and an angiogenesis assay. 

The CAM model (chick embryo chorioallantoic membrane model), first 
20 described by L. Ossowski in 1998 (J. Cell Biol., 107 :2437-2445 (1988)), 

provides another method for evaluating the urokinase inhibitory activity of a test 
compound. In the CAM model, tumor cells invade through the chorioallantoic 
membrane containing CAM with tumor cells in the presence of several serine 
protease inhibitors results in less or no invasion of the tumor cells through the 
25 membrane. Thus, the CAM assay is performed with CAM and tumor cells in the 
presence and absence of various concentrations of test compound. The 
invasiveness of tumor cells is measured under such conditions to provide an 
indication of the compound's inhibitory activity. A compound having inhibitory 
activity correlates with less tumor invasion. 
30 The CAM model is also used in a standard assay of angiogenesis (i.e., 

effect on formation of new blood vessels (Brooks et al., Methods in Molecular 
Biology, 129:257-269 (1999)). According to this model, a filter disc containing 
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an angiogenesis inducer, such as basic fibroblast growth factor (bFDG) is placed 
onto the CAM. Diffusion of the cytokine into the CAM induces local 
angiogenesis, which may be measured in several ways such as by counting the 
number of blood vessel branch points within the CAM directly below the filter 
5 disc. The ability of identified compounds to inhibit cytokine-induced 

angiogenesis can be tested using this model. A test compound can either be 
added to the filter disc that contains the angiogenesis inducer, be placed directly 
on the membrane or be administered systemically. The extent of new blood 
vessel formation in the presence and/or absence of test compound can be 

10 compared using this model. The formation of fewer new blood vessels in the 
presence of a test compound would be indicative of anti-angiogenesis activity. 
Demonstration of anti-angiogenesis activity for inhibitors of an MTSP protein 
indicates a role in angiogenesis for that MTSP protein. 

b. Known serine protease inhibitors 

15 Compounds for screening can be serine protease inhibitors, which can be 

tested for their ability to inhibit the activity of an MTSP, particularly an MTSP3, 
MTSP4, or MTSP6. 

Exemplary, but not limiting serine proteases, include the following known 
serine protease inhibitors are used in the screening assays: Serine Protease 

20 Inhibitor 3 (SPI-3) (Chen, M.C., et al., Citokine, UiUi:856-862 (1999)); 

Aprotinin (lijima, R., et al., J. Biochem. (Tokyo), 126(51 :912-916 (1999)); Kazal- 
type serine protease inhibitor-like proteins (Niimi, T., et al., Eur. J. Biochem., 
266(11:282-292 (1999)); Kunitz-type serine protease inhibitor (Ravichandran, S., 
et al.. Acta Crystailogr. D. Biol. Crystallogr. , 55(1 1 1 :1814-1 821 (1999)); Tissue 

25 factor pathway inhibitor-2/Matrix-associated serine rotease inhibitor (TFPI- 
2/MSPI), (Liu, Y., et al.. Arch, Biochem. Biophys., 370111:1 12-8 (1999)); 
Bukunin, (Cui, C.Y., et al., J. invest. Dermatol., 1 13(2) :182-8 (1999)); 
Nafmostat mesilate (Ryo, R., et al.. Vox Sang., 76(41 :241-6 (1999)); TPCK 
(Huang, Y. f et al.. Oncogene, 1 8(231 :3431-9 (1999)); A synthetic cotton-bound 

30 serine protease inhibitor (Edwards, J.V., et al., Wound Repair Regen., 7(2) :106- 
18 (1999)); FUT-175 (Sawada, M., et al., Stroke e 30(3) :644-50 (1999)); 
Combination of serine protease inhibitor FUT-0175 and thromboxane synthetase 
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inhibitor OKY-046 (Kaminogo, M., et al., Neurol. Med. Chir. (Tokyo), 
38]JJl:704-8; discussion 708-9 (1998)); The rat serine protease inhibitor 2.1 
gene (LeCam, A., et al., Biochem. Biophys. Res. Commun., 253(2) :31 1-4 
(1998)); A new intracellular serine protease inhibitor expressed in the rat 
5 pituitary gland complexes with granzyme B (Hill, R.M., et al., FEBS Lett., 

440(31:361-4 (1998)); 3,4-Dichloroisocoumarin (Hammed, A., et al., Proc. Soc. 
Exp. Biol. Med., 219(2) :132-7 (1998)); LEX032 (Bains, A.S., et al., Eur. J. 
Pharmacol.. 356(1 ) :67-72 (1998)); N-tosyl-L-phenylalanine chloromethyl ketone 
(Dryjanski, M., et al., Biochemistry, 37(40) :141 51 -6 (1998)); Mouse gene for 

10 the serine protease inhibitor neuroserpin (P1 12) (Berger, P., et al., Gene, 214(1- 
21:25-33 (1998)); Rat serine protease inhibitor 2.3 gene (Paul, C, et al., Eur. J. 
Biochem., 254(3) :538-46 (1998)); Ecotin (Yang, S.Q., et al., J. Mol. Biol., 
279141:945-57 (1998)); A 14 kDa plant-related serine protease inhibitor (Roch, 
P., et al.. Dev. Comp. Immunol., 22(1 ) :1-1 2 (1998)); Matrix-associated serine 

15 protease inhibitor TFPI-2/33 kDa MSPI (Rao, C.N., et a!., int. J. Cancer, 

76(51:749-56 (1998)); ONO-3403 (Hiwasa, T., et al., Cancer Lett., 126(2) :221- 
5 (1998)); Bdellastasin (Moser, M., et al., Eur. J. Biochem., 253(1 ) :21 2-20 
(1998)); Bikunin (Xu, Y., et al., J. Mol. Biol., 276(5) :955-66 (1998)); 
Nafamostat mesilate (Mellgren, K., et al., Thromb. Haemost., 79(21 :342-7 

20 (1998)); The growth hormone dependent serine protease inhibitor, Spi 2.1 
(Maake, C, et al., Endocrinology, 138(12) :5630-6 (1997)); Growth factor 
activator inhibitor type 2, a Kunitz-type serine protease inhibitor (Kawaguchi, T., 
et al., J. Biol. Chem., 272(441 :27558-64 (1997)); Heat-stable serine protease 
inhibitor protein from ovaries of the desert locust, Schistocerga gregaria 

25 (Hamdaoui, A., et al., Biochem. Biophys. Res. Commun., 238(2) :357-60 
(1997)); Bikunin, (Delaria, K.A., et al., J. Biol. Chem., 272(1 8) :1 2209-14 
(1997)); Human placental bikunin (Marlor, C.W., et al., J. Biol. Chem., 
272(101 :12202-8 (1997)); Hepatocyte growth factor activator inhibitor, a novel 
Kunitz-type serine protease inhibitor (Shimomura, T., et al., J. Biol. Chem., 

30 2721101:6370-6 (1997)); FUT-187, oral serine protease inhibitor, (Shiozaki, H., 
et al., Gan To Kaguku Ryoho, 23(14) : 1971-9 (1996)); Extracellular matrix- 
associated serine protease inhibitors (Mr 33,000, 31,000, and 27,000 (Rao, 
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C. N., et al., Arch. Biochem. Biophys., 335(1 ) :82-92 (1996)); An irreversible 
isocoumarin serine protease inhibitor (Palencia, D.D., et al., Biol. Reprod., 
55(3) :536-42 (1 996)); 4-(2-aminoethyl)-benzenesulfonyl fluoride (AEBSF) 
(Nakabo, Y., et al., J. Leukoc. Biol. , 60(3) :328-36 (1996)); Neuroserpin 

5 (Osterwalder, T., et al., EMBO J., 1 5(1 21 :2944-53 (1996)); Human serine 
protease inhibitor alpha- 1 -antitrypsin (Forney, J.R., et al., J. Parasitol.. 
82(3) :496-502 (1996)); Rat serine protease inhibitor 2.3 (Simar-Blanchet, A.E., 
et ah, Eur. J. Biochem., 236(2) :638-48 (1996)); Gebaxate mesilate (parodi, F., 
et al., J. Cardiothorac. Vase. Anesth., 10(2) :235-7 (1996)); Recombinant serine 
10 protease inhibitor, CPTI II (Stankiewicz, M., et al., (Acta Biochim. Pol., 

43(31:525-9 (1996)); A cysteine-rich serine protease inhibitor (Guamerin II) (Kim, 

D. R., et al., J. Enzym. Inhib., 10(2) :81-91 (1996)); Diisopropylfluorophosphate 
(Lundqvist, H., et al., Inflamm. Res., 44(12) :510-7 (1995)); Nexin 1 (Yu, D.W., 
et al., J. Ce/ISci., 108(Pt 12) :3867-74 (1995)); LEX032 (Scalia, R„ et al., 

15 Shock, 4(4) :251-6 (1995)); Protease nexin I (Houenou, L.J., et al., Proc. Natl. 
Acad. Sci. U.S.A.. 92(3) :895-9 (1995)); Chyrnase-directed serine protease 
inhibitor (Woodard S.L., et al., J. Immunol., 1 53(1 1 ) :501 6-25 (1994)); N-alpha- 
tosyl-L-lysyl-chloromethyl ketone (TLCK) (Bourinbaiar, A.S., et al., Cell Immunol., 
155111:230-6 (1994)); Smpi56 (Ghendler, Y., et al., Exp. Parasitol., 78(2) :121- 

20 31 (1994)); Schistosoma haematobium serine protease (Blanton, R.E., et al., 
Mol. Biochem. Parasitol., 63(11 :1-1 1 (1994)); Spi-1 (Warren, W.C., et al., Mol. 
Cell Endocrinol. f 98(11:27-32 (1993)); TAME (Jessop, J.J., et al., Inflammation, 
17(51:613-31 (1993)); Antithrombin III (Kalaria, R.N., et al.. Am. J. Pathol., 
143(31:886-93 (1993)); FOY-305 (Ohkoshi, M„ et al., Anticancer Res. f 

25 13(4) :963-6 (1993)); Camostat mesilate (Senda, S., et al., Intern. Med., 

32(41:350-4 (1993)); Pigment epithelium-derived factor (Steele, F.R., et al., 
Proc. Natl. Acad. ScL U.S.A., 90(41:1526-30 (1993)); Antistasin (Holstein, 
T.W., et al., FEBS Lett., 309(31:288-92 (1992)); The vaccinia virus K2L gene 
encodes a serine protease inhibitor (Zhou, J., et al., Virology, 189(2) :678-86 

30 (1992)); Bowman-Birk serine-protease inhibitor (Werner, M.H., et al., J. Mol. 
Biol., 225(31:873-89 (1992); FUT-175 (Yanamoto, H., et al., Neurosurgery, 
30(31:358-63 (1992)); FUT-175; (Yanamoto, H., et al„ Neurosurgery, 
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30(3) :351-6, discussion 356-7 (1992)); PAI-I (Yreadwell, B.V., et al. r J. Orthop. 
Res., 9(3) ;309-1 6 (1991)); 3,4-Dichloroisocoumarin (Rusbridge, N.M., et al. f 
FEBS Lett., 268(11 :133-6 (1990)); Alpha 1 -antichymotrypsin (Lindmark, B.E., et 
al., Am. Rev. Resp/r. Pes.. 141(4 Pt 1) :884-8 (1990)); P-toluenesulfonyl-L- 
5 arginine methyl ester (TAME) (Scuderi, P., J. Immunol., 143(1 ) :1 68-73 (1989)); 
Aprotinin (Seto, S., et al., Adv. Exp. Med. Biol., 247B:49-54 (1989)); Alpha 1- 
antichymotrypsin (Abraham, C.R., et al., Cell, 52(4) :487-501 (1988)); 
Contrapsin (Modha, J., et al., Parasitology, 96 (Pt 1) :99-109 (1988)); (FOY-305) 
(Yamauchi, Y., et al., Hiroshima J. Med. Sci. f 36(1 1 :81-7 No abstract available 

10 (1987)); Alpha 2-antiplasmin (Holmes, W.E., et al. # J. Biol. Chem., 262(4) :1659- 
64 (1987)); 3,4-dichloroisocoumarin (Harper, J.W., et al., Biochemistry, 
24(8) :1831-41 (1985)); Diisoprophylfluorophosphate (Tsutsui, K„ et al., 
Biochem. Biophys. Res. Commun., 1 23(1 ) :271-7 (1984)); Gabexate mesilate 
(Hesse, B., et al., Pharmacol. Res. Commun., 1 6(7) :637-45 (1984)); Phenyl 

15 methyl sulfonyl fluoride (Dufer, J., et al., Scand. J. Haematol. , 32(1 ) :25-32 

(1984)); Aprotinin (Seto, S., et al., Hypertension, 5(6) :893-9 (1983)); Protease 
inhibitor CI-2 (McPhalen, C.A., et al., J. MoL Biol., 168(21 :445-7 (1983)); 
Phenylmethylsulfonyl fluoride (Sekar V., et al., Biochem. Biophys. Res. 
Commun., 89(2) :474-8 (1979)); PGE1 (Feinstein, M.D., et al., Prostaglandine, 

20 14(61:1075-93(1977) 

c. Combinatorial libraries and other libraries 
The source of compounds for the screening assays, can be libraries, 
including, but are not limited to, combinatorial libraries. Methods for 
synthesizing combinatorial libraries and characteristics of such combinatorial 

25 libraries are known in the art (See generally, Combinatorial Libraries: Synthesis, 
Screening and Application Potential (Cortese Ed.) Walter de Gruyter, Inc., 1995; 
Tietze and Lieb, Curr. Opin. Chem. Biol., 2(3) :363-71 (1998); Lam, Anticancer 
Drug Des., 1 2(3) :1 45-67 (1997); Blaney and Martin, Curr. Opin. Chem. Biol., 
1(11:54-9 (1997); and Schultz and Schultz, BiotechnoL Prog., 12(6) :729-43 

30 (1996)). 

Methods and strategies for generating diverse libraries, primarily peptide- 
and nucleotide-based oligomer libraries, have been developed using molecular 
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biology methods and/or simultaneous chemical synthesis methodologies (see, 
e.g., Dower et al., Anna. Pep. Med. Chem., 26:271-280 (1991); Fodor et al., 
Science, 251 :767-773 (1991); Jung et al., Angew. Chem. Ind. Ed. EngL, 
31:367-383 (1992); Zuckerman et al., Proc. Natl. Acad. ScL USA, 89:4505- 
5 4509 (1992); Scott et al.. Science, 249 :386-390 (1990); Devlin et al., Science, 
249 :404-406 (1990); Cwirla et al., Proc. Natl. Acad. Sci. USA, 87:6378-6382 

(1990) ; and Gallop et al., J. Medicinal Chemistry, 37:1233-1251 (1994)). The 
resulting combinatorial libraries potentially contain millions of compounds and 
that can be screened to identify compounds that exhibit a selected activity. 

10 The libraries fall into roughly three categories: fusion-protein-displayed 

peptide libraries in which random peptides or proteins are presented on the 
surface of phage particles or proteins expressed from plasmids; support-bound 
synthetic chemical libraries in which individual compounds or mixtures of 
compounds are presented on insoluble matrices, such as resin beads (see, e.g., 

15 Lam et al., Nature, 354 :82-84 (1991)) and cotton supports (see, e.g., Eichler et 
al., Biochemistry 32:1 1035-11 041 (1993)); and methods in which the 
compounds are used in solution (see, e.g., Houghten et al., Nature, 354 :84-86 

(1991) ; Houghten et al., BioTechniques, 313 :412-421 (1992); and Scott et al., 
Curr. Opin. Bi o techno!. , 5.:40-48 (1994)). There are numerous examples of 

20 synthetic peptide and oligonucleotide combinatorial libraries and there are many 
methods for producing libraries that contain non-peptidic small organic mole- 
cules. Such libraries can be based on basis set of monomers that are combined 
to form mixtures of diverse organic molecules or that can be combined to form a 
library based upon a selected pharmacophore monomer. 

25 Either a random or a deterministic combinatorial library can be screened 

by the presently disclosed and/or claimed screening methods. In either of these 
two libraries, each unit of the library is isolated and/or immobilized on a solid 
support. In the deterministic library, one knows a priori a particular unit's 
location on each solid support. In a random library, the location of a particular 

30 unit is not known a priori although each site still contains a single unique unit. 
Many methods for preparing libraries are known to those of skill in this art (see, 
e.g., G ysen et al., Proc. Natl. Acad. Sci. USA, 81:3998-4002 (1984), 
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Houghten et aL, Proc. Natl. Acad. Sci. USA, 81:5131-5135 (1985)). 
Combinatorial library generated by the any techniques known to those of skill in 
the art are contemplated (see, e.g.. Table 1 of Schultz and Schuftz, Biotechnol. 
Prog., 12(61 :729-43 (1996)) for screening; Bartel etaL, Science, 261 :141 1- 
5 1418 (1993); Baumbach etaL BioPharm, (May) : 24-3 5 (1992); Bock etaL 
Nature, 355 :564-566 (1992); Borman, S„ Combinatorial chemists focus on 
small molecules molecular recognition, and automation, Chem. Eng. News, 
2(121:29 (1996); Boublik, et aL, Eukaryotic Virus Display: Engineering the Major 
Surface Glycoproteins of the Autographa California Nuclear Polyhedrosis Virus 

10 (ACNPV) for the Presentation of Foreign Proteins on the Virus Surface, 

Bio/Technology, 1.3:1079-1084 (1995); Brenner, et a!. p Encoded Combinatorial 
Chemistry, Proc. Natl. Acad ScL U.S.A., 89:5381-5383 (1992); Caflisch, et aL, 
Computational Combinatorial Chemistry for De Novo Ligand Design: Review and 
Assessment, Perspect. Drug Discovery Des. r 3:51-84 (1995); Cheng, et aL, 

15 Sequence-Selective Peptide Binding with a Peptido-A,B-f/a/JS-steroidal Receptor 
Selected from an Encoded Combinatorial Library, J. Am. Chem. Soc, 1 1 8 :181 3- 
1814 (1996); Chu, et aL, Affinity Capillary Electrophoresis to Identify the 
Peptide in A Peptide Library that Binds Most Tightly to Vancomycin, J. Org. 
Chem., 58:648-652 (1993); Clackson, et aL, Making Antibody Fragments Using 

20 Phage Display Libraries, Nature, 352 :624-628 (1991); Combs, et aL, Protein 
Structure-Based Combinatorial Chemistry: Discovery of Non-Peptide Binding 
Elements to Src SH3 Domain, J. Am. Chem. Soc, 1 18 :287-288 (1996); Cwirla, 
et aL, Peptides On Phage: A Vast Library of Peptides for Identifying Ligands, 
Proc. Natl. Acad. ScL U.S.A., 87:6378-6382 (1990); Ecker, et aL, Combinatorial 

25 Drug Discovery: Which Method will Produce the Greatest Value, 

Bio/Technology, 13:351-360 (1995); Ellington, et aL, In Vitro Selection of RNA 
Molecules That Bind Specific Ligands, Nature, 346 :818-822 (1990); EMman, 
J.A., Variants of Benzodiazepines, J. Am. Chem. Soc, 1 14 :10997 (1992); 
Erickson, et aL, 77?e Proteins', Neurath, H., Hill, R.L., Eds.: Academic: New York, 

30 1976; pp. 255-257; Felici, et aL, J. MoL BioL, 222 :301-310 (1991); Fodor, et 
aL, Light-Directed, Spatially Addressable Parallel Chemical Synthesis, Science, 
251:767-773 (1991); Francisco, et aL, Transport and Anchoring of Beta- 



RECTIFIED SHEET (RULE 91) 



WO 01/57194 



PCT/USO 1/034 71 



-94- 

Lactamase to the External Surface of E. Co//., Proc. Natl. Acad. Sci. U.S.A., 
89:2713-2717 (1992); Georgiou, et al.. Practical Applications of Engineering 
Gram-Negative Bacterial Cell Surfaces, TIBTECH, 11:6-10 (1993); Geysen, et al.. 
Use of peptide synthesis to probe viral antigens for epitopes to a resolution of a 
5 single amino acid, Proc. Natl. Acad. Sci. U.S.A., 81:3998-4002 (1984); Glaser, 
et al., Antibody Engineering by Condon-Based Mutagenesis in a Filamentous 
Phage Vector System, J. Immunol., 149 :3903-3913 (1992); Gram, et al., In 
vitro selection and affinity maturation of antibodies from a naive combinatorial 
immunoglobulin. library, Proc. Natl. Acad. Sci. f 89:3576-3580 (1992); Han, et 

10 al., Liquid-Phase Combinatorial Synthesis, Proc. Natl. Acad. Sci. U.S.A., 
92:6419-6423 (1995); Hoogenboom, et al., Multi-Subunit Proteins on the 
Surface of Filamentous Phage: Methodologies for Displaying Antibody (Fab) 
Heavy and Light Chains, Nucleic Acids Res., 1_9:41 33-41 37 (1991); Houghten, 
et al., General Method for the Rapid Solid-Phase Synthesis of Large Numbers of 

15 Peptides: Specificity of Antigen-Antibody Interaction at the Level of Individual 
Amino Acids, Proc. Natl. Acad. Sci. U.S.A., 82:5131-5135 (1985); Houghten, 
et al., The Use of Synthetic Peptide Combinatorial Libraries for the Determination 
of Peptide Ligands in Radio-Receptor Assays-Opiod-Peptides, Bioorg. Med. 
. Chem. Lett., 3:405-412 (1993); Houghten, et al.. Generation and Use of 

20 Synthetic Peptide Combinatorial Libraries for Basic Research and Drug Discovery, 
Nature, 354 :84-86 (1991); Huang, et al.. Discovery of New Ligand Binding 
Pathways in Myoglobin by Random Mutagenesis, Nature Struct. Biol. , 1:226-229 
(1994); Huse, et al., Generation of a Large Combinatorial Library of the 
Immunoglobulin Repertoire In Phage Lambda, Science, 246 :1275-1281 (1989); 

25 Janda, K.D., New Strategies for the Design of Catalytic Antibodies, Biotechnol. 
Prog., 6:178-181 (1990); Jung, et al.. Multiple Peptide Synthesis Methods and 
Their Applications, Angew. Chem. Int. Ed. Engl., 31:367-486 (1992); Kang, et 
al., Linkage of Recognition and Replication Functions By Assembling 
Combinatorial Antibody Fab Libraries Along Phage Surfaces, Proc. Natl. Acad. 

30 Sci. U.S.A., 88:4363-4366 (1991a); Kang, et al.. Antibody Redesign by Chain 
Shuffling from Random Combinatorial Immunoglobulin Libraries, Proc. Natl. 
Acad. Sci. U.S.A., 88:1 1 120-1 1 123 (1991b); Kay, et al., An Ml 3 Phage Library 
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Displaying Random 38-Amino-Acid-Peptides as a Source of Novel Sequences 
with Affinity to Selected Targets Genes, Gene, 128 :59-65 (1993); Lam, et al., A 
new type of synthetic peptide library for identifying ligand-binding activity, 
Nature, 354 :82-84 (1991) (published errata apear in Nature, 358 :434 (1992) 
5 and Nature, 360 :768 (1992); Lebl, et al.. One Bead One Structure Combinatorial 
Libraries, Biopofymers (Pept. SciJ, 37:177-198 (1995); Lerner, et al., Antibodies 
without Immunization, Science, 258 :131 3-1314 (1992); Li, et al.. Minimization 
of a Polypeptide Hormone, Science, 270:1657-1660 (1995); Light, et al., 
Display of Dimeric Bacterial Alkaline Phosphatase on the Major Coat Protein of 
10 Filamentous Bacteriophage, Bioorg. Med. Chem. Lett., 3:1073-1079 (1992); 
Little, et al.. Bacterial Surface Presentation of Proteins and Peptides: An 
Alternative to Phage Technology, Trends BiotechnoL, 1J_:3-5 (1993); Marks, et 
al., By-Passing Immunization. Human Antibodies from V-Gene Libraries 
Displayed on Phage, J. Mot. Bioi. , 222:581-597 (1991); Matthews, et al., 

15 Substrate Phage: Selection of Protease Substrates by Monovalent Phage Display, 
Science, 260 :1 113-1 117 (1993); McCafferty, et al., Phage Enzymes: Expression 
and Affinity Chromatography of Functional Alkaline Phosphatase on the Surface 
of Bacteriophage, Protein Eng., 4:955-961 (1991); Menger, et al., Phosphatase 
Catalysis Developed Via Combinatorial Organic Chemistry/ J- Org. Chem., 

20 60:6666-6667 (1995); Nicolaou, et al., Angew. Chem. int. Ed. EngL, 34:2289- 
2291 (1995); Oldenburg, et al., Peptide Ligands for A Sugar-Binding Protein 
Isolated from a Random Peptide Library, Proc. Nati. Acad. Sci. U.S.A., 89:5393- 
5397 (1992); Parmley, et al., Antibody-Selectable Filamentous fd Phage Vectors: 
Affinity Purification of Target Genes, Genes, 73:305-318 (1988); Pinilla, et al., 

25 Synthetic Peptide Combinatorial Libraries (SPCLS)-ldentification of the Antigenic 
Determinant of Beta-Endorphin Recognized by Monoclonal Antibody-3E7, Gene, 
128 :71-76 (1993); Pinilla, et al.. Review of the Utility of Soluble Combinatorial 
Libraries, Biopoiymers, 37:221-240 (1995); Pistor, et al., Expression of Viral 
Hemegglutinan On the Surface of £1 Colh, Kiin. Wochenschr., 66:110-1 16 

30 (1989); Pollack, et al.. Selective Chemical Catalysis by an Antibody, Science, 

234:1570-1 572 (1986); Rigler, et al., Fluorescence Correlations, Single Molecule 
Detection and Large Number Screening: Applications in Biotechnology, J. 
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Biotechnol., 41:177-186 (1995); Sarvetnick, et al., Increasing the Chemical 
Potential of the Germ-Line Antibody Repertoire, Proc. Nat/. Acad. Sci. U.S.A., 
90:4008-401 1 (1993); Sastry, et ah, Cloning of the Immunological Repertiore in 
Escherichia Co// for Generation of Monoclonal Catalytic Antibodies: Construction 
5 of a Heavy Chain Variable Region-Specific cDNA Library, Proc. Natl. Acad. Set. 
U.S.A., 86:5728-5732 (1989); Scott, et al.. Searching for Peptide Ligands with 
an Epitope Library, Science, 249 :386-390 (1990); Sears, et al., Engineering 
Enzymes for Bioorganic Synthesis: Peptide Bond Formation, Biotechnoi. Prog., 
12:423-433 (1996); Simon, et. al.. Peptides: A Modular Approach to Drug 

10 Discovery, Proc. Nati. Acad. Sci. U.S.A., 89:9367-9371 (1992); Still, et al., 

Discovery of Sequence-Selective Peptide Binding by Synthetic Receptors Using 
Encoded Combinatorial Libraries, Acc. Chem. Res., 29:1 55-1 63 (1996); 
Thompson, et al.. Synthesis and Applications of Small Molecule Libraries, Chem. 
Rev., 96:555-600 (1996); Tramontano, et al.-. Catalytic Antibodies, Science, 

15 234 :1566-1570 (1986); Wrighton, et al.. Small Peptides as Potent Mimetics of 
the Protein Hormone Erythropoietin, Science, 273 :458-464 (1996); York, et al.. 
Combinatorial mutagenesis of the reactive site region in plasminogen activator 
inhibitor I, J. Biol. Chem., 266 :8595-8600 (1991); Zebedee, et al.. Human 
Combinatorial Antibody Libraries to Hepatitis B Surface Antigen, Proc. Nati. 

20 Acad. Sci. U.S.A., 89:3175-3179 (1992); Zuckerman, et al., Identification of 
Highest-Affinity Ligands by Affinity Selection from Equimolar Peptide Mixtures 
Generated by Robotic Synthesis, Proc. Natl. Acad. Sci. U.S.A., 89:4505-4509 
(1992). 

For example, peptides that bind to an MTSP protein or a protease domain 
25 of an MTSP protein can be identified using phage display libraries. In an 

exemplary embodiment, this method can include a) contacting phage from a 
phage library with the MTSP protein or a protease domain thereof; (b) isolating 
phage that bind to the protein; and (c) determining the identity of at least one 
peptide coded by the isolated phage to identify a peptide that binds to an MTSP 
30 protein. 
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H. Modulat rs of th activity of MTSP proteins 

Provided herein are compounds, identified by screening or produced using 
the MTSP proteins or protease domain in other screening methods, that 
modulate the activity of an MTSP. These compounds act by directly interacting 
5 with the MTSP protein or by altering transcription or translation thereof. Such 
molecules include, but are not limited to, antibodies that specifically react with 
an MTSP protein, particularly with the protease domain thereof, antisense 
nucleic acids that alter expression of the MTSP protein, antibodies, peptide 
mimetics and other such compounds. 
0 1 . Antibodies 

Antibodies, including polyclonal and monoclonal antibodies, that 
specifically bind to the MTSP proteins provided herein, particularly to the single 
chain protease domains thereof are provided. Preferably, the antibody is a 
monoclonal antibody, and preferably, the antibody specifically binds to the 
protease domain of the MTSP protein. In particular embodiments, antibodies to 
each of the single chain of protease domain of MTSP1 , MTSP3, MTSP4 and 
MTSP6. Also provided are antibodies that specifically bind to any domain of 
MTSP3 or MTSP4, and to double chain forms thereof. 

The MTSP protein and domains, fragments, homologs and derivatives 
thereof may be used as immunogens to generate antibodies that specifically bind 
such immunogens. Such antibodies include but are not limited to polyclonal, 
monoclonal, chimeric, single chain, Fab fragments, and an Fab expression 
library. In a specific embodiment, antibodies to human MTSP protein are 
produced. In another embodiment, complexes formed from fragments of MTSP 
protein, which fragments contain the serine protease domain, are used as 
immunogens for antibody production. 

Various procedures known in the art may be used for the production of 
polyclonal antibodies to MTSP protein, its domains, derivatives, fragments or 
analogs. For production of the antibody, various host animals can be immunized 
by injection with the native MTSP protein or a synthetic version, or a derivative 
of the foregoing, such as a cross-linked MTSP protein. Such host animals 
include but are not limited to rabbits, mice, rats, etc. Various adjuvants can be 
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used to increase the immunological response, depending on the host species, 
and include but are not limited to Freund's (complete and incomplete), mineral 
gels such as aluminum hydroxide, surface active substances such as 
lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, dinitrophenol, 
5 and potentially useful human adjuvants such as bacille Calmette-Guerin (BCG) 
and corynebacterium parvum. 

For preparation of monoclonal antibodies directed towards an MTSP 
protein or domains, derivatives, fragments or analogs thereof, any technique that 
provides for the production of antibody molecules by continuous cell lines in 

10 culture may be used. Such techniques include but are not restricted to the 
hybridoma technique originally developed by Kohler and Milstein [Nature 
256 :495-497 (1975)), the trioma technique, the human B-cell hybridoma 
technique (Kozbor et al., immunology Today 4:7 '2 (1983)), and the EBV 
hybridoma technique to produce human monoclonal antibodies (Cole et al., in 

15 Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96 

(1985)). In an additional embodiment, monoclonal antibodies can be produced in 
germ-free animals utilizing recent technology (PCT/US90/02545). Human 
antibodies may be used and can be obtained by using human hybridomas (Cote 
et al., Proc. Natl. Acad. Sci. USA 80:2026-2030 (1983)). Or by transforming 

20 human B cells with EBV virus in vitro (Cole et al., in Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss, Inc., pp. 77-96 (1985)). Techniques developed 
for the production of "chimeric antibodies" (Morrison et al., Proc. Natl. Acad. 
Sci. USA 81:6851-6855 (1984); Neuberger et al.. Nature 312 :604-608 (1984); 
Takeda et al.. Nature 314 :452-454 (1985)) by splicing the genes from a mouse 

25 antibody molecule specific for the MTSP protein together with genes from a 
human antibody molecule of appropriate biological activity can be used. 

Techniques described for the production of single chain antibodies (U.S. 
patent 4,946,778) can be adapted to produce MTSP protein-specific single chain 
antibodies. An additional embodiment uses the techniques described for the 

30 construction of Fab expression libraries (Huse et al., Science 246 :1275-1281 

(1989)) to allow rapid and easy identification of monoclonal Fab fragments with 
the desired specificity for MTSP protein or MTSP protein, or domains, 
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derivatives, or analogs thereof. Non-human antibodies can be "humanized" by 
known methods {see, e.g., U.S. Patent No. 5,225,539). 

Antibody fragments that contain the idiotypes of MTSP protein can be 
generated by techniques known in the art. For example, such fragments include 
5 but are not limited to: the F(ab')2 fragment which can be produced by pepsin 
digestion of the antibody molecule; the Fab' fragments that can be generated by 
reducing the disulfide bridges of the F(ab')2 fragment, the Fab fragments that 
can be generated by treating the antibody molecular with papain and a reducing 
agent, and Fv fragments. 

10 In the production of antibodies, screening for the desired antibody can be 

accomplished by techniques known in the art, e.g., ELISA (enzyme-linked 
immunosorbent assay). To select antibodies specific to a particular domain of 
the MTSP protein one may assay generated hybridomas for a product that binds 
to the fragment of the MTSP protein that contains such a domain 

15 The foregoing antibodies can be used in methods known in the art 

relating to the localization and/or quantitation of MTSP proteins, e.g., for imaging 
these proteins, measuring levels thereof in appropriate physiological samples, in 
diagnostic methods, etc. 

In another embodiment, (see infra), anti-MTSP protein antibodies, or 

20 fragments thereof, containing the binding domain are used as therapeutic agents. 

2. Peptides and Peptide Mimetics 

Provided herein are methods for identifying molecules that bind to and 
modulate the activity of MTSP proteins. Included among molecules that bind to 
MTSPs, particularly the single chain protease domain or catalytically active 

25 fragments thereof, are peptides and peptide mimetics. Peptide mimetics are 

molecules or compounds that mimic the necessary molecular conformation of a 
ligand or polypeptide for specific binding to a target molecule such as, e.g., an 
MTSP protein. In an exemplary embodiment, the peptides or peptide mimetics 
bind to the protease domain of the MTSP protein. Such peptides and peptide 

30 mimetics include those of antibodies that specifically bind an MTSP protein and, 
preferably, bind to the protease domain of an MTSP protein. The peptides and 
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peptide mimetics identified by methods provided herein can be agonists or 
antagonists of MTSP proteins. 

Such peptides and peptide mimetics are useful for diagnosing, treating, 
preventing, and screening for a disease or disorder associated with MTSP protein 
5 activity in a mammal. In addition, the peptides and peptide mimetics are useful 
for identifying, isolating, and purifying molecules or compounds that modulate 
the activity of an MTSP protein, or specifically bind to an MTSP protein, 
preferably, the protease domain of an MTSP protein. Low molecular weight 
peptides and peptide mimetics can have strong binding properties to a target 
10 molecule, e.g., an MTSP protein or, preferably, to the protease domain of an 
MTSP protein. 

Peptides and peptide mimetics that bind to MTSP proteins as described 
herein can be administered to mammals, including humans, to modulate MTSP 
protein activity. Thus, methods for therapeutic treatment and prevention of 

1 5 neoplastic diseases comprise administering a peptide or peptide mimetic 

compound in an amount sufficient to modulate such activity are provided. Thus, 
also provided herein are methods for treating a subject having such a disease or 
disorder in which a peptide or peptide mimetic compound is administered to the 
subject in a therapeutically effective dose or amount. 

20 Compositions containing the peptides or peptide mimetics can be 

administered for prophylactic and/or therapeutic treatments. In therapeutic 
applications, compositions can be administered to a patient already suffering 
from a disease, as described above, in an amount sufficient to cure or at least 
partially arrest the symptoms of the disease and its complications. Amounts 

25 effective for this use will depend on the severity of the disease and the weight 
and general state of the patient. 

4 

In prophylactic applications, compositions containing the peptides and 
peptide mimetics are administered to a patient susceptible to or otherwise at risk 
of a particular disease. Such an amount is defined to be a "prophylactically 
30 effective dose". In this use, the precise amounts again depend on the patient's 
state of health and weight. 
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Accordingly, the peptides and peptide mimetics that bind to an MTSP 
protein can be used generating pharmaceutical compositions containing, as an 
active ingredient, at least one of the peptides or peptide mimetics in association 
with a pharmaceutical carrier or diluent. The compounds can be administered, for 
5 example, by oral, pulmonary, parental (intramuscular, intraperitoneal, intravenous 
(IV) or subcutaneous injection), inhalation (via a fine powder formulation), 
transdermal, nasal, vaginal, rectal, or sublingual routes of administration and can 
be formulated in dosage forms appropriate for each route of administration (see, 
e.g., International PCT application Nos. WO 93/25221 and WO 94/17784; and 

10 European Patent Application 613,683). 

Peptides and peptide mimetics that bind to MTSP proteins are useful in 
vitro as unique tools for understanding the biological role of MTSP proteins, 
including the evaluation of the many factors thought to influence, and be 
influenced by, the production of MTSP protein. Such peptides and peptide 

1 5 mimetics are also useful in the development of other compounds that bind to and 
modulate the activity of an MTSP protein, because such compounds provide 
important information on the relationship between structure and activity that 
should facilitate such development. 

The peptides and peptide mimetics are also useful as competitive binders 

20 in assays to screen for new MTSP proteins or MTSP protein agonists. In such 
assay embodiments, the compounds can be used without modification or can be 
modified in a variety of ways; for example, by labeling, such as covalently or 
non-covalently joining a moiety which directly or indirectly provides a detectable 
signal. In any of these assays, the materials thereto can be labeled either 

25 directly or indirectly. Possibilities for direct labeling include label groups such as: 
radiolabels such as 125 l enzymes (U.S. Pat. No. 3,645,090) such as peroxidase 
and alkaline phosphatase, and fluorescent labels (U.S. Pat. No. 3,940,475) 
capable of monitoring the change in fluorescence intensity, wavelength shift, or 
fluorescence polarization. Possibilities for indirect labeling include biotinylation of 

30 one constituent followed by binding to avidin coupled to one of the above label 
groups. The compounds may also include spacers or linkers in cases where the 
compounds are to be attached to a solid support. 
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Moreover, based on their ability to bind to an MTSP protein, the peptides 
and peptide mimetics can be used as reagents for detecting MTSP proteins in 
living cells, fixed cells, in biological fluids, in tissue homogenates, in purified, 
natural biological materials, etc. For example, by labelling such peptides and 

5 peptide mimetics, one can identify cells having MTSP proteins. In addition, 
based on their ability to bind an MTSP protein, the peptides and peptide 
mimetics can be used in situ staining, FACS (fluorescence-activated cell sorting). 
Western blotting, ELISA, etc. In addition, based on their ability to bind to an 
MTSP protein, the peptides and peptide mimetics can be used in purification of 

0 MTSP protein polypeptides or in purifying cells expressing the MTSP protein 
polypeptides, e.g. , a polypeptide encoding the protease domain of an MTSP 
protein. 

The peptides and peptide mimetics can also be used as commercial 
reagents for various medical research and diagnostic uses. 
5 The activity of the peptides and peptide mimetics can be evaluated either 

in vitro or in vivo in one of the numerous models described in McDonald (1 992) 
Am. J. of Pediatric Hematology/Oncology. 74:8-21 , which is incorporated herein 
by reference. 

3. Peptide and peptide mimetic therapy 

Peptides and peptide mimetics that can bind to MTSP proteins or the 
protease domain of MTSP proteins and modulate the activity thereof, or have 
MTSP protein activity, can be used for treatment of neoplastic diseases. The 
peptides and peptide mimetics may be delivered, in vivo or ex vivo, to the cells 
of a subject in need of treatment. Further, peptides which have MTSP protein 
activity can be delivered, in vivo or ex vivo, to cells which carry mutant or 
missing alleles encoding the MTSP protein gene. Any of the techniques 
described herein or known to the skilled artisan can be used for preparation and 
in vivo or ex vivo delivery of such peptides and peptide mimetics that are 
substantially free of other human proteins. For example, the peptides can be 
readily prepared by expression in a microorganism or synthesis in vitro. 

The peptides or peptide mimetics can be introduced into cells, in vivo or 
ex vivo, by microinjection or by use of liposomes, for example. Alternatively, the 
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peptides or peptide mimetics may be taken up by cells, in vivo or ex vivo, 
actively or by diffusion. In addition, extracellular application of the peptide or 
peptide mimetic may be sufficient to effect treatment of a neoplastic disease. 
Other molecules, such as drugs or organic compounds, that: 1 ) bind to an 
5 MTSP protein or protease domain thereof; or 2) have a similar function or 

activity to an MTSP protein or protease domain thereof, may be used in methods 
for treatment. 

4. Rational drug design 

The goal of rational drug design is to produce structural analogs of 
10 biologically active polypeptides or peptides of interest or of small molecules or 
peptide mimetics with which they interact (e.g., agonists, antagonists, inhibitors) 
in order to fashion drugs which are, e.g., more active or stable forms thereof; or 
which, e.g., enhance or interfere with the function of a polypeptide in vivo (e.g., 
an MTSP protein). In one approach, one first determines the three-dimensional 
15 structure of a protein of interest (e.g., an MTSP protein or polypeptide having a 
protease domain) or, for example, of a MTSP protein-Iigand complex, by X-ray 
crystallography, by computer modeling or most typically, by a combination of 
approaches (see, e.g., Erickson et ai. 1990). Also, useful information regarding 
the structure of a. polypeptide may be gained by modeling based on the structure 
20 of homologous proteins. In addition, peptides can be analyzed by an alanine 

scan. In this technique, an amino acid residue is replaced by Ala, and its effect 
on the peptide's activity is determined. Each of the amino acid residues of the 
peptide is analyzed in this manner to determine the important regions of the 
peptide. 

25 Also, a polypeptide or peptide that binds to an MTSP protein or, 

preferably, the protease domain of an MTSP protein, can be selected by a 
functional assay, and then the crystal structure of this polypeptide or peptide 
can be determined. The polypeptide can be, for example, an antibody specific 
for an MTSP protein or the protein domain of an MTSP protein. This approach 

30 can yield a pharmacore upon which subsequent drug design can be based. 
Further, it is possible to bypass the crystallography altogether by generating 
anti-idiotypic polypeptides or peptides, (anti-ids) to a functional, 
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pharmacologically active polypeptide or peptide that binds to an MTSP protein or 
protease domain of an MTSP protein. As a mirror image of a mirror image, the 
binding site of the anti-ids is expected to be an analog of the original target 
molecule, e.g., an MTSP protein or polypeptide having an MTSP protein. The 
5 anti-id could then be used to identify and isolate peptides from banks of 

chemically or biologically produced banks of peptides. Selected peptides would 
then act as the pharmacore. 

Thus, one may design drugs which have, e.g., improved activity or 
stability or which act as modulators (e.g., inhibitors, agonists, antagonists, etc.) 

10 of MTSP protein activity, and are useful in the methods, particularly the methods 
for diagnosis, treatment, prevention, and screening of a neoplastic disease. By 
virtue of the availability of cloned MTSP protein sequences, sufficient amounts 
of the MTSP protein polypeptide may be made available to perform such 
analytical studies as X-ray crystallography. In addition, the knowledge of the 

1 5 amino acid sequence of an MTSP protein or the protease domain thereof, e.g., 

the protease domain encoded by the amino acid sequence of SEQ ID NO: 2, can 

provide guidance on computer modeling techniques in place of, or in addition to, 

X-ray crystallography. 

Methods of identifying peptides and peptide mimetics that bind to 
20 MTSP proteins 

Peptides having a binding affinity to the MTSP protein polypeptides 

provided herein (e.g., an MTSP protein or a polypeptide having a protease 

domain of an MTSP protein) can be readily identified, for example, by random 

peptide diversity generating systems coupled with an affinity enrichment 

25 process. Specifically, random peptide diversity generating systems include the 
"peptides on plasmids" system (see, e.g., U.S. Patent Nos. 5,270,170 and 
5,338,665); the "peptides on phage" system (see, e.g., U.S. Patent No. 
6,121,238 and Cwirla,ef <?/. (1990) Proc. Natl. Acad. Sci. U.S.A. 
57:6378-6382); the "polysome system;" the "encoded synthetic library (ESL) n 

30 system; and the "very large scale immobilized polymer synthesis" system (see, 
e.g., U.S. Patent No. 6,121,238; and Dower et al. (1991) An. Rep. Med. Chem. 
26:271-280 
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For example, using the procedures described above, random peptides can 
generally be designed to have a defined number of amino acid residues in length 
(e.g., 12). To generate the collection of oligonucleotides encoding the random 
peptides, the codon motif (NNK)x, where N is nucleotide A, C, G, or T 
5 (equimolar; depending on the methodology employed, other nucleotides can be 
employed), K is G or T (equimolar), and x is an integer corresponding to the 
number of amino acids in the peptide (e.g., 12) can be used to specify any one 
of the 32 possible codons resulting from the NNK motif: 1 for each of 12 amino 
acids, 2 for each of 5 amino acids, 3 for each of 3 amino acids, and only one of 

10 the three stop codons. Thus, the NNK motif encodes all of the amino acids, 
encodes only one stop codon, and reduces codon bias. 

The random peptides can be presented, for example, either on the surface 
of a phage particle, as part of a fusion protein containing either the pill or the 
pVIII coat protein of a phage fd derivative (peptides on phage) or as a fusion 

15 protein with the Lacl peptide fusion protein bound to a plasmid (peptides on 

plasmids). The phage or plasmids, including the DNA encoding the peptides, can 
be identified and isolated by an affinity enrichment process using immobilized 
MTSP protein polypeptide having a protease domain. The affinity enrichment 
process, sometimes called "panning," typically involves multiple rounds of 

20 incubating the phage, plasmids, or polysomes with the immobilized MTSP protein 
polypeptide, collecting the phage, plasmids, or polysomes that bind to the MTSP 
protein polypeptide (along with the accompanying DNA or mRNA), and 
producing more of the phage or plasmids (along with the accompanying 
Lacl-peptide fusion protein) collected. 

25 Characteristics of peptides and peptide mimetics 

Typically, the molecular weight of preferred peptides or peptide mimetics 
is from about 250 to about 8,000 daltons. If the peptides are oligomerized, 
dimerized and/or derivatized with a hydrophilic polymer (e.g., to increase the 
affinity and/or activity of the compounds), the molecular weights of such 

30 peptides can be substantially greater and can range anywhere from about 500 to 
about 1 20,000 daltons, more preferably from about 8,000 to about 80,000 
daltons. Such peptides can comprise 9 or more amino acids wherein the amino 
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acids are naturally occurring or synthetic (non-naturally occurring) amino acids. 
One skilled in the art would know how to determine the affinity and molecular 
weight of the peptides and peptide mimetics suitable for therapeutic and/or 
diagnostic purposes {e.g., see Dower et at., U.S. Patent No. 6,121,238). 
5 The peptides may be covalently attached to one or more of a variety of 

hydrophilic polymers. Suitable hydrophilic. polymers include, but are not limited 
to, polyalkylethers as exemplified by polyethylene glycol and polypropylene 
glycol, polylactic acid, polyglycolic acid, polyoxyalkenes, polyvinylalcohol, 
polyvinylpyrrolidone, cellulose and cellulose derivatives, dextran and dextran 

10 derivatives, etc. When the peptide compounds are derivatized with such 

polymers, their solubility and circulation half-lives can be increased with little, if 
any, diminishment in their binding activity. The peptide compounds may be 
dimerized and each of the dimeric subunits can be covalently attached to a 
hydrophilic polymer. The peptide compounds can be PEGylated, i.e., covalently 

15 attached to polyethylene glycol (PEG). 

Peptide analogs are commonly used in the pharmaceutical industry as 
non-peptide drugs with properties analogous to those of the template peptide. 
These types of non-peptide compounds are termed "peptide mimetics" or 
"peptidomimetics" (Luthman etaL, A Textbook of Drug Design and 

20 Development, 14:386-406, 2nd Ed., Harwood Academic Publishers (1996); 

Joachim Grante (1 994) Angew. Chem. int. Ed. Engl., 33:1699-1720; Fauchere 
(1986) J. Adv. Drug Res., 75:29; Veber and Freidinger (1985) TINS, p. 392; and 
Evans eta/. (1987) J. Med. Chem. 30:1229). Peptide mimetics that are 
structurally similar to therapeutically useful peptides may be used to produce an 

25 equivalent or enhanced therapeutic or prophylactic effect. Preparation of 

peptidomimetics and structures thereof are known to those of skill in this art. 

Systematic substitution of one or more amino acids of a consensus 
sequence with a D-amino acid of the same type [e.g., D-lysine in place of 
L-lysine) may be used to generate more stable peptides. In addition, constrained 

30 peptides containing a consensus sequence or a substantially identical consensus 
sequence variation may be generated by methods known in the art (Rizo et al. 
(1992) An. Rev. Biochem., 57:387, incorporated herein by reference); for 
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example, by adding interna! cysteine residues capable of forming intramolecular 
disulfide bridges which cyclize the peptide. 

Those skilled in the art would appreciate that modifications may be made 
to the peptides and mimetics without deleteriously effecting the biological or 
5 functional activity of the peptide. Further, the skilled artisan would know how to 
design non-peptide structures in three dimensional terms, that mimic the 
peptides that bind to a target molecule, e.g., an MTSP protein or, preferably, the 
protease domain of MTSP proteins {see, e.g., Eck and Sprang (1989) J. Biol. 
Chem., 26: 17605-18795). 
0 When used for diagnostic purposes, the peptides and peptide mimetics 

may be labeled with a detectable label and, accordingly, the peptides and 
peptide mimetics without such a label can serve as intermediates in the 
preparation of labeled peptides and peptide mimetics. Detectable labels can be 
molecules or compounds, which when covalently attached to the peptides and 
5 peptide mimetics, permit detection of the peptide and peptide mimetics in vivo, 
for example, in a patient to whom the peptide or peptide mimetic has been 
administered, or in vitro, e.g., in a sample or cells. Suitable detectable labels are 
well known in the art and include, by way of example, radioisotopes, fluorescent 
labels (e.g., fluorescein), and the like. The particular detectable label employed 
is not critical and is selected relative to the amount of label to be employed as 
well as the toxicity of the label at the amount of label employed. Selection of 
the label relative to such factors is well within the skill of the art. 

Covalent attachment of a detectable label to the peptide or peptide 
mimetic is accomplished by conventional methods well known in the art. For 
example, when the 125 l radioisotope is employed as the detectable label, covalent 
attachment of 125 l to the peptide or the peptide mimetic can be achieved by 
incorporating the amino acid tyrosine into the peptide or peptide mimetic and 
then iodinating the peptide (see, e.g., Weaner et al. (1994) Synthesis and 
Appiications of isotopicaiiy Labelled Compounds, pp. 137-140). If tyrosine is 
not present in the peptide or peptide mimetic, incorporation of tyrosine to the N 
or C terminus of the peptide or peptide mimetic can be achieved by well known 
chemistry. Likewise, M P can be incorporated onto the peptide or peptide 
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mimetic as a phosphate moiety through, for example, a hydroxyl group on the 
peptide or peptide mimetic using conventional chemistry. 

Labeling of peptidomimetics usually involves covalent attachment of one 
or more labels, directly or through a spacer {e.g., an amide group), to 
5 non-interfering position(s) on the peptidomimetic that are predicted by 
quantitative structure-activity data and/or molecular modeling. Such 
non-interfering positions generally are positions that do not form direct contacts 
with the macromolecules(s) to which the peptidomimetic binds to produce the 
therapeutic effect. Derivatization (e.g., labeling) of peptidomimetics should not 

4 

10 substantially interfere with the desired biological or pharmacological activity of 
the peptidomimetic. 

6. Methods of preparing peptides and peptide mimetics 
Peptides that bind to MTSP proteins can be prepared by classical methods 
known in the art, for example, by using standard solid phase techniques. The 

15 standard methods include exclusive solid phase synthesis, partial solid phase 
synthesis methods, fragment condensation, classical solution synthesis, and 
even by recombinant DNA technology (see, e.g., Merrifield (1963) J. Am. Chem. 
Soc, 55:2149, incorporated herein by reference.) 

Using the "encoded synthetic library" or "very large scale immobilized 

20 polymer synthesis" systems (see, e.g., U.S. Patent No. 5,925,525, and 

5,902,723); one can not only determine the minimum size of a peptide with the 
activity of interest, one can also make all of the peptides that form the group of 
peptides that differ from the preferred motif (or the minimum size of that motif) 
in one, two, or more residues. This collection of peptides can then be screened 

25 for ability to bind to the target molecule, e.g., and MTSP protein or, preferably, 
the protease domain of an MTSP protein. This immobilized polymer synthesis 
system or other peptide synthesis methods can also be used to synthesize 
truncation analogs and deletion analogs and combinations of truncation and 
deletion analogs of the peptide compounds. 

30 These procedures can also be used to synthesize peptides in which amino 

acids other than the 20 naturally occurring, genetically encoded amino acids are 
substituted at one, two, or more positions of the peptide. For instance, 
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naphthyfalanine can be substituted for tryptophan, facilitating synthesis. Other 

synthetic amino acids that can be substituted into the peptides include 

L-hydroxypropyl, L-3, 4-dihydroxy-phenylalanyI, d amino acids such as 

L-d-hydroxylysyl and D-d-methylalanyl, L-a-methylalanyl, fi amino acids, and 
5 isoquinolyl. D amino acids and non-naturally occurring synthetic amino acids 

can also be incorporated into the peptides (see, e.g., Roberts eta/. (1983) 

Unusual Amino/ Acids in Peptide Synthesis, 5(6):341-449). 

The peptides may also be modified by phosphorylation (see, e.g., W. 

Bannwarth eta/. (1996) Biorganic and Medicinal Chemistry Letters, 
10 5(17):2141-2146), and other methods for making peptide derivatives (see, e.g., 

Hruby eta/. (1990) Biochem. J., 26S(2):249-262). Thus, peptide compounds 

also serve as a basis to prepare peptide mimetics with similar biological activity. 
Those of skill in the art recognize that a variety of techniques are 

available for constructing peptide mimetics with the same or similar desired 
15 biological activity as the corresponding peptide compound but with more 

favorable activity than the peptide with respect to solubility, stability, and 

susceptibility to hydrolysis and proteolysis (see, e.g., Morgan eta/. (1989) An. 

Rep. Med. Chem., 24:243-252). Methods for preparing peptide mimetics 

modified at the ISNterminal amino group, the C-terminal carboxyl group, and/or 
20 changing one or more of the amido linkages in the peptide to a non-amido 

linkage are known to those of skill in the art. 

Amino terminus modifications include alkylating, acetylating, adding a 

carbobenzoyl group, forming a succinimide group, etc. (see, e.g., Murray eta/. 

(1 9,95) Burger's Medicinal Chemistry and Drug Discovery, 5th ed., Vol. 1, 
25 Manfred E. Wolf, ed., John Wiley and Sons, Inc.). C-terminal modifications 

include mimetics wherein the C-terminal carboxyl group is replaced by an ester, 

an amide or modifications to form a cyclic peptide. 

In addition to N-terminal and C-terminal modifications, the peptide 

compounds, including peptide mimetics, can advantageously be modified with or 
30 covalently coupled to one or more of a variety of hydrophilic polymers. It has 

been found that when* peptide compounds are derivatized with a hydrophilic 

polymer, their solubility and circulation half-lives may be increased and their 
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immunogenicity is masked, with little, if any, diminishment in their binding 
activity. Suitable nonproteinaceous polymers include, but are not limited to, 
polyalkylethers as exemplified by polyethylene glycol and polypropylene glycol, 
polylactic acid, polyglycolic acid, polyoxyalkenes, polyvinylalcohol, 
5 polyvinylpyrrolidone, cellulose and cellulose derivatives, dextran and dextran 
derivatives, etc. Generally, such hydrophilic polymers have an average 
molecular weight ranging from about 500 to about 100,000 daltons, more 
preferably from about 2,000 to about 40,000 daltons and, even more preferably, 
from about 5,000 to about 20,000 daltons. The hydrophilic polymers also can 
10 have an average molecular weights of about 5,000 daltons, 10,000 daltons and 
20,000 daltons. 

Methods for derivatizing peptide compounds or for coupling peptides to 
such polymers have been described (see, e.g., Zallipsky (1995) Bioconjugate 
Chem., 6:150-165; Monfardini eta/. (1995) Bioconjugate Chem., 6:62-69; U.S. 

15 Pat. No! 4,640,835; U.S. Pat. No. 4,496,689; U.S. Pat. No. 4,301,144; U.S. 

Pat. No. 4,670,417; U.S. Pat. No. 4,791,192; U.S. Pat. No. 4,179,337 and WO 
95/34326, all of which are incorporated by reference in their entirety herein). 

Other methods for making peptide derivatives are described, for example, 
in Hruby et af. (1990), Bfochem J., 266{2):249-262, which is incorporated 

20 herein by reference. Thus, the peptide compounds also serve as structural 
models for non-peptidic compounds with similar biological activity. Those of 
skill in the art recognize that a variety of techniques are available for 
constructing compounds with the same or similar desired biological activity as a 
particular peptide compound but with more favorable activity with respect to 

25 solubility, stability, and susceptibility to hydrolysis and proteolysis (see, e.g., 

Morgan eta/. (1989)>to. Rep. Med. Chem., 24:243-252, incorporated herein by 
reference). These techniques include replacing the peptide backbone with a 
backbone composed of phosphonates, amidates, carbamates, sulfonamides, 
secondary amines, and N-methylamino acids. 

30 Peptide compounds may exist in a cyclized form with an intramolecular 

disulfide bond between the thiol groups of the cysteines. Alternatively, an 
intermolecular disulfide bond between the thiol groups of the cysteines can be 
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produced to yield a dimeric (or higher oligomeric) compound. One or more of the 
cysteine residues may also be substituted with a homocysteine. 
I. CONJUGATES 

A conjugate, containing: a) a single chain protease domain (or 
5 proteolytically active portion thereof) of an MTSP protein or an MTSP3, MTSP4 
or MTSP6 full length zymogen, activated form thereof, or double or single chain 
protease domain thereof; and b) a targeting agent linked to the MTSP protein 
directly or via a linker, wherein the agent facilitates: i) affinity isolation or 
purification of the conjugate; ii) attachment of the conjugate to a surface; iii) 
10 detection of the conjugate; or iv) targeted delivery to a selected tissue or cell, is 
provided herein. The conjugate can be a chemical conjugate or a fusion protein 
mixture thereof. 

The targeting agent is preferably a protein or peptide fragment, such as a 
tissue specific or tumor specific monoclonal antibody or growth factor or 

15 fragment thereof linked either directly or via a linker to an MTSP protein or a 

protease domain thereof. The targeting agent may also be a protein or peptide 
fragment that contains a protein binding sequence, a nucleic acid binding 
sequence, a lipid binding sequence, a polysaccharide binding sequence, or a 
metal binding sequence, or a linker for attachment to a solid support. In a 

20 particular embodiment, the conjugate contains a) the MTSP or portion thereof, as 
described herein; and b) a targeting agent linked to the MTSP protein directly or 
via a linker. 

Conjugates, such as fusion proteins and chemical conjugates, of the 
MTSP protein with a protein or peptide fragment (or plurality thereof) 

25 that functions, for example, to facilitate affinity isolation or purification of the 
MTSP protein domain, attachment of the MTSP protein domain to a surface, or 
detection of the MTSP protein domain are provided. The conjugates can be 
produced by chemical conjugation, such as via thiol linkages, but are preferably 
produced by recombinant means as fusion proteins. In the fusion protein, the 

30 peptide or fragment thereof is linked to either the N-terminus or C-terminus of 
the MTSP protein domain. In chemical conjugates the peptide or fragment 
thereof may be linked anywhere that conjugation can be effect d, and there may 
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be a plurality of such peptides or fragments linked to a single MTSP protein 
domain or to a plurality thereof. 

The targeting agent is preferably for in vitro delivery to a cell or tissue, 
and includes agents such as cell or tissue-specific antibodies, growth factors and 
5 other factors expressed on specific cells; and other cell or tissue specific agents 
the promote directed delivery of a linked protein. 

Most preferably the targeting agent specifically delivers the MTSP protein 
to selected cells by interaction with a cell surface protein and internalization of 
conjugate or MTSP protein portion thereof. These conjugate are used in a variety 
10 of methods and are particularly suited for use in methods of activation of 

prodrugs, such as prodrugs that upon cleavage by the particular MTSP protein 
are cytotoxic. The prodrugs are administered prior to simultaneously with or 
subsequently to the conjugate. Upon delivery to the targeted cells, the protease 
activates the prodrug, which then exhibits is therapeutic effect, such as a 
15 cytotoxic effect. 

1 . Conjugation 

Conjugates with linked MTSP protein domains can be prepared either by 
chemical conjugation, recombinant DNA technology, or combinations of 
recombinant expression and chemical conjugation. The MTSP protein domains 
20 and the targeting agent may be linked in any orientation and more than one 
targeting agents and/or MTSP protein domains may be present in a conjugate. 

a. Fusion proteins 

Fusion proteins are proved herein. A fusion protein contains: a) one or a 
plurality of domains of an MTSP proteins and b) a targeting agent. The fusion 
25 proteins are preferably produced by recombinant expression of nucleic acids that 
encode the fusion protein. 

b. Chemical conjugation 

To effect chemical conjugation herein, the MTSP protein domain is linked 
via one or more selected linkers or directly to the targeting agent. Chemical 
30 conjugation must be used if the targeted agent is other than a peptide or protein, 
such a nucleic acid or a non-peptide drug. Any means known to those of skill in 
the art for chemically conjugating selected moieties may be used. 
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2. Linkers 

Linkers for two purposes are contemplated herein. The conjugates may 
include one or more linkers between the MTSP protein portion and the targeting 
agent. Additionally, linkers are used for facilitating or enhancing immobilization 
5 of an MTSP protein or portion thereof on a solid support, such as a microtiter 
plate, silicon or silicon-coated chip, glass or plastic support, such as for high 
throughput solid phase screening protocols. 

Any linker known to those of skill in the art for preparation of conjugates 
may be used herein. These linkers are typically used in the preparation of 

10 chemical conjugates; peptide linkers may be incorporated into fusion proteins. 

Linkers can be any moiety suitable to associate a domain of MTSP protein 
and a targeting agent. Such linkers and linkages include, but are not limited to, 
peptidic linkages, amino acid and peptide linkages, typically containing between 
one and about 60 amino acids, more generally between about 10 and 30 amino 

15 acids, chemical linkers, such as heterobifunctional cleavable cross-linkers, 

including but are not limited to, N-succinimidyl (4-iodoacetyl)-aminobenzoate f 
sulfosuccinimydil (4-iodoacetyD-aminobenzoate, 4-succinimidyl-oxycarbonyl-a- 
{2-pyridyldithio)toluene, sulfosuccinimidyl-6- [a-methyl-a-(pyridyldithiol)- 
toluamido] hexanoate, N-succinimidyl-3-{-2-pyridyldithio) - propionate, 

20 succinimidyl 6[3(-{-2-pyridyldithio)-proprionamido] hexanoate, sulfosuccinimidyl 
6[3H-2-pyridyldithio}-propionamido] hexanoate, 3-{2-pyridyldithio)-propionyl 
hydrazide, Ellman's reagent, dichlorotriazinic acid, and S-{2-thiopyridyl)-L- 
cysteine. Other linkers include, but are not limited to peptides and other 
moieties that reduce stearic hindrance between the domain of MTSP protein and 

25 the targeting agent, intracellular enzyme substrates, linkers that increase the 
flexibility of the conjugate, linkers that increase the solubility of the conjugate, 
linkers that increase the serum stability of the conjugate, photocleavable linkers 
and acid cleavable linkers. 



RECTIFIED SHEET (RULE 91) 



WO 01/57194 

PCT/USOl/03471 



-114- 



10 



Other exemplary linkers and linkages that are suitable for chemically 
linked conjugates include, but are not limited to, disulfide bonds, thioether 
bonds, hindered disulfide bonds, and covalent bonds between free reactive 
groups, such as amine and thiol groups. These bonds are produced using 
heterobifunctional reagents to produce reactive thiol groups on one or both of 
the polypeptides and then reacting the thiol groups on one polypeptide with 
reactive thiol groups or amine groups to which reactive maleimido groups or thiol 
groups can be attached on the other. Other linkers include, acid cleavable 
linkers, such as bismaleimideothoxy propane, acid labile-transferrin conjugates 
and adipic acid dihydrazide, that would be cleaved in more acidic intracellular 
compartments; cross linkers that are cleaved upon exposure to UV or visible 
light and linkers, such as the various domains, such as C H 1. C„2, and C H 3, from 
the constant region of human IgG, (see, Batra era/. Molecular Immunol., 
30:379-386 (1993)). In some embodiments, several linkers may be included in 
15 order to take advantage of desired properties of each linker. 

Chemical linkers and peptide linkers may be inserted by covalently 
coupling the linker to the domain of MTSP protein and the targeting agent. The 
heterobifunctional agents, described below, may be used to effect such covalent 
coupling. Peptide linkers may also be linked by expressing DNA encoding the 
linker and TA, linker and targeted agent, or linker, targeted agent and TA as a 
fusion protein. Flexible linkers and linkers that increase solubility of the 
conjugates are contemplated for use, either alone or with other linkers are also 
contemplated herein. 

a) Acid cleavable, photocleavable and heat sensitive linkers 
25 Acid cleavable linkers, photocleavable and heat sensitive linkers may also 

be used, particularly where it may be necessary to cleave the domain of MTSP 
protein to permit it to be more readily accessible to reaction. Acid cleavable 
linkers include, but are not limited to, bismaleimideothoxy propane; and adipic 
acid dihydrazide linkers (see, e.g., Fattom era/. (1992) Infection & Immun. 
30 50:584-589) and acid labile transferrin conjugates that contain a sufficient 
portion of transferrin to permit entry into the intracellular transferrin cycling 
pathway (see, e.g., Welh8ner etat. (1991)./ Biol. Chem. 255:4309-4314). 



20 
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Photocleavable linkers are linkers that are cleaved upon exposure to light 
(see, e.g., Goldmacher et al. (1992) Bioconj. Chem. 3:104-107, which linkers 
are herein incorporated by reference), thereby releasing the targeted agent upon 
exposure to light. Photocleavable linkers that are cleaved upon exposure to light 
5 are known (see, e.g., Hazum era/. (1981) in Pept., Proc. Eur. Pept. Symp., 

16th, Brunfeldt, K (Ed), pp. 105-110, which describes the use of a nitrobenzyl 
group as a photocleavable protective group for cysteine; Yen eta/. (1989) 
Makromoi. Chem 750:69-82, which describes water soluble photocleavable 
copolymers, including hydroxypropylmethacrylamide copolymer, glycine 

10 copolymer, fluorescein copolymer and methylrhodamine copolymer; Gold- 
macher et al. (1992) Bioconj. Chem. 3:104-107, which describes a cross-linker 
and reagent that undergoes photolytic degradation upon exposure to near UV 
light (350 nm); and Senter et al. (1985) Photochem. Photobiol 42:231-237, 
which describes nitrobenzyloxycarbonyl chloride cross linking reagents that 

15 produce photocleavable linkages), thereby releasing the targeted agent upon 
exposure to light. Such linkers would have particular use in treating 
dermatological or ophthalmic conditions that can be exposed to light using fiber 
optics. After administration of the conjugate, the eye or skin or other body part 
can be exposed to light, resulting in release of the targeted moiety from the 

20 conjugate. Such photocleavable linkers are useful in connection with diagnostic 
protocols in which it is desirable to remove the targeting agent to permit rapid 
clearance from the body of the animal. 

b) Other linkers for chemical conjugation 
Other linkers, include trityl linkers, particularly, derivatized 

25 trityl groups to generate a genus of conjugates that provide for 

release of therapeutic agents at various degrees of acidity or alkalinity. 

The flexibility thus afforded by the ability to preselect the pH range at 

which the therapeutic agent will be released allows selection of a linker based on 

the known physiological differences between tissues in need of delivery of a 

30 therapeutic agent (see, e.g., U.S. Patent No. 5,612,474). For example, the 
acidity of tumor tissues appears to be lower than that of normal tissues. 
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c) Peptide linkers 
The linker moieties can be peptides. Peptide linkers can be employed in 
fusion proteins and also in chemically linked conjugates. The peptide typically 
has from about 2 to about 60 amino acid residues, for example from about 5 to 
5 about 40, or from about 10 to about 30 amino acid residues. The length 
selected will depend upon factors, such as the use for which the linker is 
included. 

Peptide linkers are advantageous when. the targeting agent is 
proteinaceous. For example, the linker moiety can be a flexible spacer amino 

10 acid sequence, such as those known in single-chain antibody research. 
Examples of such known linker moieties include, but are not limited to, 
peptides, such as (Gly m Ser) n and (Ser m Gly) n , in which n is 1 to 6, preferably 1 to 
4, more preferably 2 to 4, and m is 1 to 6, preferably 1 to 4, more preferably 2 
to 4, enzyme cleavable linkers and others. 

15 Additional linking moieties are described, for example, in Huston et al., 

Proc. Natl. Acad. Sci. U.S.A. 55:5879-5883, 1988; Whitlow, M., et al., Protein 
Engineering 5:989-995, 1993; Newton etaL, Biochemistry 35:545-553, 1 996; 
A. J. Cumber et al, Bioconj. Chem. 3:397-401, 1992; Ladurner et al., J. Mol. 
Bio!. 273:330-337, 1997; and U.S. Patent. No. 4,894,443. In some 

20 embodiments, several linkers may be included in order to take advantage of 
desired properties of each linker. 

3. Targeting agents 

Any agent that facilitates detection, immobilization, or purification of the 
conjugate is contemplated for use herein. For chemical conjugates any moiety 

25 that has such properties is contemplated; for fusion proteins, the targeting agent 
is a protein, peptide or fragment thereof that sufficient to effects the targeting 
activity. Preferred targeting agents are those that deliver the MTSP protein or 
portion thereof to selected cells and tissues. Such agents include tumor specific 
monoclonal antibodies and portions thereof, growth factors, such as FGF, EGF, 

30 PDGF, VEGF, cytokines, including chemokines, and other such agents. 

4. Nucleic acids, plasmids and cells 
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Isolated nucleic acid fragments encoding fusion proteins are provided. 
The nucleic acid fragment that encodes the fusion protein includes: a) nucleic 
acid encoding a protease domain of an MTSP protein encoded by a nucleic acid 
that hybridizes to a nucleic acid having the nucleotide sequence set forth in the 
5 SEQ. ID NO:1; and b) nucleic acid encoding a protein, peptide or effective 

fragment thereof that facilitates: i) affinity isolation or purification of the fusion 
protein; ii) attachment of the fusion protein to a surface; or iii) detection of the 
fusion protein. Preferably, the nucleic acid is DNA.. 

Plasmids for replication and vectors for expression that contain the above 

10 nucleic acid fragments are also provided. Cells containing the plasmids and 

vectors are also provided. The cells can be any suitable host including, but are 
not limited to, bacterial cells, yeast cells, fungal cells, plant cells, insect cell and 
animal cells. The nucleic acids, plasmids, and cells containing the plasmids can 
be prepared according to methods known in the art including any described 

15 herein. 

Also provided are methods for producing the above fusion proteins. An 
exemplary method includes the steps of growing, i.e. culturing the cells so that 
the proliferate, cells containing a plasmid encoding the fusion protein under 
conditions whereby the fusion protein is expressed by the cell, and recovering 
20 the expressed fusion protein. Methods for expressing and recovering 

recombinant proteins are well known in the art {See generally. Current Protocols 
in Molecular Biology (1998) § 16, John Wiley & Sons, Inc.) and such methods 
can be used for expressing and recovering the expressed fusion proteins. 
Preferably, the recombinant expression and recovery methods disclosed in 

25 Section B can be used. 

The recovered fusion proteins can be isolated or purified by methods 
known in the art such as centrifugation, filtration, chromatograph, 
electrophoresis, immunoprecipitation, etc., or by a combination thereof {See 
generally. Current Protocols in Molecular Biology (1998) § 10, John Wiley & 

30 Sons, Inc.). Preferably, the recovered fusion protein is isolated or purified 

through affinity binding between the protein or peptide fragment of the fusion 
protein and an affinity binding moiety. As discussed in the above sections 
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regarding the construction of the fusion proteins, any affinity binding pairs can 
be constructed and used in the isolation or purification of the fusion proteins. 
For example, the affinity binding pairs can be protein binding sequences/protein, 
DNA binding sequences/DNA sequences, RNA binding sequences/RNA 
5 sequences, lipid binding sequences/lipid, polysaccharide binding 
sequences/polysaccharide, or metal binding sequences/metal. 

5. Immobilization and supports or substrates therefor 
In certain embodiments, where the targeting agents are designed for 
linkage to surfaces, the MTSP protein can be attached by linkage such as ionic 
10 or covalent, non-covalent or other chemical interaction, to a surface of a support 
or matrix material. Immobilization may be effected directly or via a linker. The 
MTSP protein may be immobilized on any suitable support, including, but are not 
limited to, silicon chips, and other supports described herein and known to those 
of skill in the art. A plurality of MTSP protein or protease domains thereof may 
15 be attached to a support, such as an array (/.e., a pattern of two or more) of 
conjugates on the surface of a silicon chip or other chip for use in high 
throughput protocols and formats. 

It is also noted that the domains of the MTSP protein can be linked 
directly to the surface or via a linker without a targeting agent linked thereto. 
20 Hence chips containing arrays of the domains of the MTSP protein. 

The matrix material or solid supports contemplated herein are generally 
any of the insoluble materials known to those of skill in the art to immobilize 
ligands and other molecules, and are those that used in many chemical 
syntheses and separations. Such supports are used, for example, in affinity 
25 chromatography, in the immobilization of biologically active materials, and during 
chemical syntheses of biomolecules, including proteins, amino acids and other 
organic molecules and polymers. The preparation of and use of supports is well 
known to those of skill in this art; there are many such materials and 
preparations thereof known. For example, naturally-occurring support materials, 
30 such as agarose and cellulose, may be isolated from their respective sources, 
and processed according to known protocols, and synthetic materials may be 
prepared in accord with known protocols. 
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The supports are typically insoluble materials that are solid, porous, 
deformable, or hard, and have any required structure and geometry, including, 
but not limited to: beads, pellets, disks, capillaries, hollow fibers, needles, solid 
fibers, random shapes, thin films and membranes. Thus, the item may be 
5 fabricated from the matrix material or combined with it, such as by coating all or 
part of the surface or impregnating particles. 

Typically, when the matrix is particulate, the particles are at least about 
10-2000 /yM, but may be smaller or larger, depending upon the selected 
application. Selection of the matrices will be governed, at least in part, by their 

10 physical and chemical properties, such as solubility, functional groups, 

mechanical stability, surface area swelling propensity, hydrophobic or hydrophilic 
properties and intended use. 

If necessary, the support matrix material can be treated to contain an 
appropriate reactive moiety. In some cases, the support matrix material already 

15 containing the reactive moiety may be obtained commercially. The support 

matrix material containing the reactive moiety may thereby serve as the matrix 
support upon which molecules are linked. Materials containing reactive surface 
moieties such as amino silane linkages, hydroxyl linkages or carboxysiiane 
linkages may be produced by well established surface chemistry techniques 

20 involving silanization reactions, or the like. Examples of these materials are 

those having surface silicon oxide moieties, covalently linked to gamma-amino- 
propylsilane, and other organic moieties; N-[3-(triethyoxysilyl)propyl]phthelamic 
acid; and bis-(2-hydroxyethyl)aminopropyltriethoxysilane. Exemplary of readily 
available materials containing amino group reactive functionalities, include, but 

25 are not limited to, para-aminophenyltriethyoxysilane. Also derivatized 

polystyrenes and other such polymers are well known and readily available to 
those of skill in this art (e.g., the Tentagel® Resins are available with a multitude 
of functional groups, and are sold by Rapp Polymere, Tubingen, Germany; see, 
U.S. Patent No. 4,908,405 and U.S. Patent No. 5,292,814; see, also Butz et al., 

30 Peptide Res., 7:20-23 (1994); and Kleine et al., ImmunobioL, 190:53-66 
(1994)). 



o 
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These matrix materials include any material that can act as a support 
matrix for attachment of the molecules of interest. Such materials are known to 
those of skill in this art, and include those that are used as a support matrix. 
These materials include, but are not limited to, inorganics, natural polymers, and 
5 synthetic polymers; including, but are not limited to: cellulose, cellulose 
derivatives, acrylic resins, glass, silica gels, polystyrene, gelatin, polyvinyl 
pyrrolidone, co-polymers of vinyl and acrylamide, polystyrene cross-linked with 
divinylbenzene and others {see, Merrifield, Biochemistry, 3:1385-1390 (1964)), 
polyacrylamides, latex gels, polystyrene, dextran, polyacrylamides, rubber, 

10 silicon, plastics, nitrocellulose, celluloses, natural sponges. Of particular interest 
herein, are highly porous glasses (see, e.g., U.S. Patent No. 4,244,721) and 
others prepared by mixing a borosilicate, alcohol and water. 

Synthetic supports include, but are not limited to: acrylamides, dextran- 
derivatives and dextran co-polymers, agarose-polyacrylamide blends, other 

15 polymers and co-polymers with various functional groups, methacrylate 

derivatives and corpolymers, polystyrene and polystyrene copolymers {see, e.g., 
Merrifield, Biochemistry, 3:1385-1390 (1964); Berg et al., in innovation 
Perspect. Solid Phase Synth. Coiiect. Pap., Int. Symp., 1st, Epton, Roger (Ed), 
pp. 453-459 (1990); Berg et al., Pept., Proc. Eur. Pept. Symp., 20th, Jung, G. 

20 et aL (Eds), pp. 196-198 (1989); Berg et al., J. Am. Chem. Soc, 

1 1 1 :8024-8026 (1989); Kent et al., /sr. J. Chem., 17:243-247 (1979); Kent et 
al., J. Org. Chem., 43:2845-2852 (1978); Mitchell et al.. Tetrahedron Lett., 
42:3795-3798 (1976); U.S. Patent No. 4,507,230; U.S. Patent No. 4,006,1 17; 
and U.S. Patent No. 5,389,449). Such materials include those made from 

25 polymers and co-polymers such as polyvinylaicohols, acrylates and acrylic acids 
such as polyethylene-co-acrylic acid, polyethylene-co-methacrylic acid, polyethy- 
lene-co-ethylacrylate, polyethylene-co-methyl acrylate, polypropylene-co-acrylic 
acid, poiypropylene-co-methyl-acrylic acid, polypropylene-co-ethylacrylate, 
polypropylene-co-methyl acrylate, polyethylene-co-vinyl acetate, poly- 

30 propylene-co-vinyl acetate, and those containing acid anhydride groups such as 
polyethylene-co-maleic anhydride and polypropylene-co-maleic anhydride. 
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Liposomes have also been used as solid supports for affinity purifications (Powell 

et al. Biotechnol. Bioeng., 33:173 (1989)). 

Numerous methods have been developed for the immobilization of 

proteins and other biomolecules onto solid or liquid supports (see, e.g., 
5 Mosbach, Methods in Enzymotogy, 44 (1976); Weetall, Immobilized Enzymes, 

Antigens, Antibodies, and Peptides, (1975); Kennedy et al., Solid Phase 

Biochemistry, Analytical and Synthetic Aspects, Scouten, ed., pp. 253-391 

(1983); see, generally. Affinity Techniques. Enzyme Purification: Part B. 

Methods in Enzymology, Vol. 34, ed. W. B. Jakoby, M. Wilchek, Acad. Press, 
10 N.Y. (1974); and Immobilized Biochemicals and Affinity Chromatography, 

Advances in Experimental Medicine and Biology, vol. 42, ed. R. Dunlap, Plenum 

Press, N.Y. (1974)). 

Among the most commonly used methods are absorption and adsorption 

or covalent binding to the support, either directly or via a linker, such as the 
15 numerous disulfide linkages, thioether bonds, hindered disulfide bonds, and 

covalent bonds between free reactive groups, such as amine and thiol groups, 

known to those of skill in art (see, e.g., the PIERCE CATALOG, 

ImmunoTechnoJogy Catalog & Handbook, 1 992-1 993, which describes the 

preparation of and use of such reagents and provides a commercial source for 
20 such reagents; Wong, Chemistry of Protein Conjugation and Cross Unking, CRC 

Press (1993); see also DeWitt et al., Proc. Natl. Acad. Sci. U.S.A., 90:6909 

(1993); Zuckermann et al., J. Am. Chem. Soc, 1 14 :10646 (1992); Kurth et al., 

J. Am. Chem. Soc, 1 16 :2661 (1994); Ellman et al., Proc. Natl. Acad. Sci. 

U.S.A., 91:4708 (1994); Sucholeiki, Tetrahedron Lttrs., 35:7307 (1994); Su- 
25 Sun Wang, J. Org. Chem., 41:3258 (1976); Padwa et al., J. Org. Chem., 

41:3550 (1971); and Vedejs et al., J. Org. Chem., 49:575 (1984), which 

describe photosensitive linkers). 

To effect immobilization, a composition containing the protein or other 

biomolecule is contacted with a support material such as alumina, carbon, an 
30 ion-exchange resin, cellulose, glass or a ceramic. Fluorocarbon polymers have 

been used as supports to which biomolecules have been attached by adsorption 
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(see, U.S. Patent No. 3,843,443; Published International PCT Application 

WO/86 03840). 

J. Prognosis and diagnosis 

MTSP protein proteins, domains, analogs, and derivatives thereof, and 
5 encoding nucleic acids (and sequences complementary thereto), and anti-MTSP 
protein antibodies, can be used in diagnostics. Such molecules can be used in 
assays, such as immunoassays, to detect, prognose, diagnose, or monitor 
various conditions, diseases, and disorders affecting MTSP protein expression, or 
monitor the treatment thereof. For purposes herein, the presence of MTSPs in 

10 body fluids or tumor tissues are of particular interest. 

In particular, such an immunoassay is carried out by a method including 
contacting a sample derived from a patient with an anti-MTSP protein antibody 
under conditions such that specific binding can occur, and detecting or 
measuring the amount of any specific binding by the antibody. In a specific 

15 aspect, such binding of antibody, in tissue sections, can be used to detect 
.aberrant MTSP protein localization or aberrant {e.g., low or absent) levels of 
MTSP protein. In a specific embodiment, antibody to MTSP protein can be used 
to assay in a patient tissue or serum sample for the presence of MTSP protein 
where an aberrant level of MTSP protein is an indication of a diseased condition. 

20 The immunoassays which can be used include but are not limited to 

competitive and non-competitive assay systems using techniques such as 
western blots, radioimmunoassays, ELISA {enzyme linked immunosorbent 
assay), "sandwich" immunoassays, immunoprecipitation assays, precipitin 
reactions, gel diffusion precipitin reactions, immunodiffusion assays, 

25 agglutination assays, complement-fixation assays, immunoradiometric assays, 
fluorescent immunoassays, protein A immunoassays, to name but a few. 

MTSP protein genes and related nucleic acid sequences and 
subsequences, including complementary sequences, can also be used in 
hybridization assays. MTSP protein nucleic acid sequences, or subsequences 

30 thereof containing about at least 8 nucleotides, preferably 1 4 or 1 6 or 30 or 
more continugous nucleotides, can be used as hybridization probes. 
Hybridization assays can be used to detect, prognose, diagnose, or monitor 
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conditions, disorders, or disease states associated with aberrant changes in 
MTSP protein expression and/or activity as described herein. In particular, such 
a hybridization assay is carried out by a method by contacting a sample 
containing nucleic acid with a nucleic acid probe capable of hybridizing to MTSP 
5 protein encoding DNA or RNA, under conditions such that hybridization can 
occur, and detecting or measuring any resulting hybridization. 

In a specific embodiment, a method of diagnosing a disease or disorder 
characterized by detecting an aberrant level of an MTSP protein in a subject is 
provided herein by measuring the level of the DNA, RNA, protein or functional 
10 activity of the epithelial MTSP protein at least partially encoded by a nucleic 

acid that hybridizes to a nucleic acid having the nucleotide sequence set forth in 
the SEQ. ID NO:1 in a sample derived from the subject, wherein an Increase or 
decrease in the level of the DNA, RNA, protein or functional activity of the MTSP 
protein, relative to the level of the DNA, RNA, protein or functional activity 

15 found in an analogous sample not having the disease or disorder indicates the 
presence of the disease or disorder in the subject. 

Kits for diagnostic use are also provided, that contain in one or more 
containers an anti-MTSP protein antibody, particularly anti-MTSP3 or 
anti = MTSP4, and, optionally, a labeled binding partner to the antibody. 

20 Alternatively, the anti-MTSP protein antibody can be labeled (with a detectable 
marker, e.g., a chemiluminescent, enzymatic, fluorescent, or radioactive moiety). 
A kit is also provided that includes in one or more containers a nucleic acid 
probe capable of hybridizing to MTSP protein-encoding RNA. In a specific 
embodiment, a kit can comprise in one or more containers a pair of primers (e.g., 

25 each in the size range of 6-30 nucleotides) that are capable of priming 

amplification [e.g., by polymerase chain reaction (see e.g., Innis et aL, 1990, 
PCR Protocols, Academic Press, Inc., San Diego, CA), ligase chain reaction (see 
EP 320,308) use of Qfi replicase, cyclic probe reaction, or other methods known 
in the art under appropriate reaction conditions of at least a portion of an MTSP 

30 protein-encoding nucleic acid. A kit can optionally further comprise in a 

container a predetermined amount of a purified MTSP protein or nucleic acid, 
e.g., for use as a standard or control. 
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K. PHARMACEUTICAL COMPOSITIONS AND MODES OF ADMINISTRATION 
1 . Components of the compositions 

Pharmaceutical compositions containing the identified compounds that 
modulate the activity of an MTSP protein are provided herein. Also provided are 
5 combinations of a compound that modulates the activity of an MTSP protein and 
another treatment or compound for treatment of a neoplastic disorder, such as a 
chemotherapeutic compound. 

The MTSP protein modulator and the anti-tumor agent can be packaged 
as separate compositions for administration together or sequentially or 
10 intermittently. Alternatively, they can provided as 

a single composition for administration or as two compositions for administration 
as a single composition. The combinations can be packaged as kits. 

a. MTSP protein inhibitors 
Any MTSP protein inhibitors, including those described herein when used 
15 alone or in combination with other compounds, that can alleviate, reduce, 
ameliorate, prevent, or place or maintain in a state of remission of clinical 
symptoms or diagnostic markers associated with neoplastic diseases, including 
undesired and/or uncontrolled angiogenesis, can be used in the present 
combinations. 

20 In one embodiment, the MTSP protein inhibitor is an antibody or fragment 

thereof that specifically reacts with an MTSP protein or the protease domain 
thereof, an inhibitor of the MTSP protein production, an inhibitor of the epithelial 
MTSP protein membrane-localization, or any inhibitor of the expression of or, 
especially, the activity of an MTSP protein. 

25 b. Anti-angiogenic agents and anti-tumor agents 

Any anti-angiogenic agents and anti-tumor agents, including those 
described herein, when used alone or in combination with other compounds, that 
can alleviate, reduce, ameliorate, prevent, or place or maintain in a state of 
remission of clinical symptoms or diagnostic markers associated with undesired 

30 and/or uncontrolled angiogenesis and/or tumor growth and metastasis, 
particularly solid neoplasms, vascular malformations and cardiovascular 
disorders, chronic inflammatory diseases and aberrant wound repairs, circulatory 



WO 01/57194 



PCT/US01/03471 



-125- 

disorders, crest syndromes, dermatological disorders, or ocular disorders, can be 
used in the combinations. Also contemplated are anti-tumor agents for use in 
combination with an inhibitor of an MTSP protein. 

c. Anti-tumor agents and anti-angiogenic agents 
5 The compounds identified by the methods provided herein or provided 

herein can be used in combination with anti-tumor agents and/or anti- 
angiogenesis agents. 

2. Formulations and route of administration 

The compounds herein and agents are preferably formulated as 

10 pharmaceutical compositions, preferably for single dosage administration. The 
concentrations of the compounds in the formulations are effective for delivery of 
an amount, upon administration, that is effective for the intended treatment. 
Typically, the compositions are formulated for single dosage administration. To 
formulate a composition, the weight fraction of a compound or mixture thereof is 

15 dissolved, suspended, dispersed or otherwise mixed in a selected vehicle at an 
effective concentration such that the treated condition is relieved or ameliorated. 
Pharmaceutical carriers or vehicles suitable for administration of the compounds 
provided herein include any such carriers known to those skilled in the art to be 
suitable for the particular mode of administration. 

20 In addition, the compounds may be formulated as the sole 

pharmaceutical^ active ingredient in the composition or may be combined with 
other active ingredients. Liposomal suspensions, including tissue-targeted 
liposomes, may also be suitable as pharmaceutical^ acceptable carriers. These 
may be prepared according to methods known to those skilled in the art. For 

25 example, liposome formulations may be prepared as described in U.S. Patent No, 
4,522,81 1. 

The active compound is included in the pharmaceutical^ acceptable 
carrier in an amount sufficient to exert a therapeutically useful effect in the 
absence of undesirable side effects on the patient treated. The therapeutically 
30 effective concentration may be determined empirically by testing the compounds 
in known ]n vitro and in vivo systems, such as the assays provided herein. 
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The concentration of active compound in the drug composition will 
depend on absorption, inactivation and excretion rates of the active compound, 
the physicochemical characteristics of the compound, the dosage schedule, and 
amount administered as well as other factors known to those of skill in the art. 
5 Typically a therapeutically effective dosage is contemplated. The 

amounts administered may be on the order of 0.001 to 1 mg/ml, preferably 
about 0.005-0.05 mg/ml, more preferably about 0.01 mg/ml, of blood volume. 
Pharmaceutical dosage unit forms are prepared to provide from about 1 mg to 
about 1000 mg and preferably from about 10 to about 500 mg, more preferably 
10 about 25-75 mg of the essential active ingredient or a combination of essential 
ingredients per dosage unit form. The precise dosage can be empirically 
determined. 

The active ingredient may be administered at once, or may be divided into 
a number of smaller doses to be administered at intervals of time. It is 

15 understood that the precise dosage and duration of treatment is a function of the 
disease being treated and may be determined empirically using known testing 
protocols or by extrapolation from in vivo or in vitro test data. It is to be noted 
that concentrations and dosage values may also vary with the severity of the 
condition to be alleviated. It is to be further understood that for any particular 

20 subject, specific dosage regimens should be adjusted over time according to the 
individual need and the professional judgment of the person administering or 
supervising the administration of the compositions, and that the concentration 
ranges set forth herein are exemplary only and are not intended to limit the 
scope or use of the claimed compositions and combinations containing them. 

25 Preferred pharmaceutical^ acceptable derivatives include acids, salts, 

esters, hydrates, solvates and prodrug forms. The derivative is typically selected 
such that its pharmacokinetic properties are superior to the corresponding 
neutral compound. 
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Thus, effective concentrations or amounts of one or more of the 
compounds provided herein or pharmaceutically acceptable derivatives thereof 
are mixed with a suitable pharmaceutical carrier or vehicle for systemic, topical 
or local administration to form pharmaceutical compositions. Compounds are 
5 included in an amount effective for ameliorating or treating the disorder for 

which treatment is contemplated. The concentration of active compound in the 
composition will depend on absorption, inactivation, excretion rates of the active 
compound, the dosage schedule, amount administered, particular formulation as 
well as other factors known to those of skill in the art. 

10 Solutions or suspensions used for parenteral, intradermal, subcutaneous, 

or topical application can include any of the following components: a sterile 
diluent, such as water for injection, saline solution, fixed oil, polyethylene glycol, 
glycerine, propylene glycol or other synthetic solvent; antimicrobial agents, such 
as benzyl alcohol and methyl parabens; antioxidants, such as ascorbic acid and 

15 sodium bisulfite; chelating agents, such as ethylenediaminetetraacetic acid 

(EDTA); buffers, such as acetates, citrates and phosphates; and agents for the 
adjustment of tonicity such as sodium chloride or dextrose. Parenteral 
preparations can be enclosed in ampules, disposable syringes or single or 
multiple dose viafs made of glass, plastic or other suitable material. 

20 In instances in which the compounds exhibit insufficient solubility, 

methods for solubilizing compounds may be used. Such methods are known to 
those of skill in this art, and include, but are not limited to, using cosolvents, 
such as dimethylsulfoxide (DMSO), using surfactants, such as Tween®, or 
dissolution in aqueous sodium bicarbonate. Derivatives of the compounds, such 

25 as prodrugs of the compounds may also be used in formulating effective 

pharmaceutical compositions. For ophthalmic indications, the compositions are 
formulated in an ophthalmically acceptable carrier. For the ophthalmic uses 
herein, local administration, either by topical administration or by injection is 
preferred. Time release formulations are also desirable. Typically, the 

30 compositions are formulated for single dosage administration, so that a single 
dose administers an effective amount. 
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Upon mixing or addition of the compound with the vehicle, the resulting 
mixture may be a solution, suspension, emulsion or other composition. The form 
of the resulting mixture depends upon a number of factors, including the 
intended mode of administration and the solubility of the compound in the 
5 selected carrier or vehicle. If necessary, pharmaceutical^ acceptable salts or 
other derivatives of the compounds are prepared. 

The compound is included in the pharmaceutical^ acceptable carrier in an 
amount sufficient to exert a therapeutically useful effect in the absence of 
undesirable side effects on the patient treated. It is understood that number and 

10 degree of side effects depends upon the condition for which the compounds are 
administered. For example, certain toxic and undesirable side effects are 
tolerated when treating life-threatening illnesses that would not be tolerated 
when treating disorders of lesser consequence. 

The compounds can also be mixed with other active materials, that do 

15 not impair the desired action, or with materials that supplement the desired 
action known to those of skill in the art. The formulations of the compounds 
and agents for use herein include those suitable for oral, rectal, topical, 
inhalational, buccal [e.g., sublingual), parenteral {e.g., subcutaneous, 
intramuscular, intradermal, or intravenous), transdermal administration or any 

20 route. The most suitable route in any given case will depend on the nature and 
severity of the condition being treated and on the nature of the particular active 
compound which is being used. The formulations are provided for administration 
to humans and animals in unit dosage forms, such as tablets, capsules, pills, 
powders, granules, sterile parenteral solutions or suspensions, and oral solutions 

25 or suspensions, and oil-water emulsions containing suitable quantities of the 
compounds or pharmaceutical^ acceptable derivatives thereof. The 
pharmaceutical^ therapeutically active compounds and derivatives thereof are 
typically formulated and administered in unit-dosage forms or multiple-dosage 
forms. Unit-dose forms as used herein refers to physically discrete units suitable 

30 for human and animal subjects and packaged individually as is known in the art. 
Each unit-dose contains a predetermined quantity of the therapeutically active 
compound sufficient to produce the desired therapeutic effect, in association 
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with the required pharmaceutical carrier, vehicle or diluent. Examples of 
unit-dose forms include ampoules and syringes and individually packaged tablets 
or capsules. Unit-dose forms may be administered in fractions or multiples 
thereof. A multiple-dose form is a plurality of identical unit-dosage forms 
5 packaged in a single container to be administered in segregated unit-dose form. 
Examples of multiple-dose forms include vials, bottles of tablets or capsules or 
bottles of pints or gallons. Hence, multiple dose form is a multiple of unit-doses 
which are not segregated in packaging. 

The composition can contain along with the active ingredient: a diluent 

10 such as lactose, sucrose, dicalcium phosphate, or carboxymethylcellulose; a 
lubricant, such as magnesium stearate, calcium stearate and talc; and a binder 
such as starch, natural gums, such as gum acaciagelatin, glucose, molasses, 
polvinylpyrrolidine, celluloses and derivatives thereof, povidone, crospovidones 
and other such binders known to those of skill in the art. Liquid 

15 pharmaceutical^ administrable compositions can, for example, be prepared by 
dissolving, dispersing, or otherwise mixing an active compound as defined above 
and optional pharmaceutical adjuvants in a carrier, such as, for example, water, 
saline, aqueous dextrose, glycerol, glycols, ethanol, and the like, to thereby form 
a solution or suspension. If desired, the pharmaceutical composition to be 

20 administered may also contain minor amounts of nontoxic auxiliary substances 
such as wetting agents, emulsifying agents, or solubilizing agents, pH buffering 
agents and the like, for example, acetate, sodium citrate, cyclodextrine 
derivatives, sorbitan monolaurate, triethanolamine sodium acetate, 
triethanolamine oleate, and other such agents. Methods of preparing such 

25 dosage forms are known, or will be apparent, to those skilled in this art (see, 
e.g., Remington's Pharmaceutical Sciences, Mack Publishing Company, Easton, 
Pa., 15th Edition, 1975). The composition or formulation to be administered will 
contain a quantity of the active compound in an amount sufficient to alleviate 
the symptoms of the treated subject. 

30 Dosage forms or compositions containing active ingredient in the range of 

0.005% to 100% with the balance made up from non-toxic carrier may be 
prepared.. For oral administration, the pharmaceutical compositions may take the 
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form of, for example, tablets or capsules prepared by conventional means with 
pharmaceutically acceptable excipients such as binding agents [e.g., 
pregelatinized maize starch, polyvinyl pyrrolidone or hydroxypropyl 
methylcellulose); fillers (e.g., lactose, microcrystalline cellulose or calcium 
5 hydrogen phosphate); lubricants (e.g., magnesium stearate, talc or silica); 

disintegrants {e.g., potato starch or sodium starch glycolate); or wetting agents 
(e.g., sodium lauryl sulphate). The tablets may be coated by methods well- 
known in the art. 

The pharmaceutical preparation may also be in liquid form, for example, 
10 solutions, syrups or suspensions, or may be presented as a drug product for 
reconstitution with water or other suitable vehicle before use. Such liquid 
preparations may be prepared by conventional means with pharmaceutically 
acceptable additives such as suspending agents (e.g., sorbitol syrup, cellulose 
derivatives or hydrogenated edible fats); emulsifying agents (e.g., lecithin or 
15 acacia); non-aqueous vehicles (e.g., almond oil, oily esters, or fractionated 

vegetable oils); and preservatives (e.g., methyl or propyl-p-hydroxybenzoates or 
sorbic acid). 

Formulations suitable for rectal administration are preferably presented as 
unit dose suppositories. These may be prepared by admixing the active 

20 compound with one or more conventional solid carriers, for example, cocoa 
butter, and then shaping the resulting mixture. 

Formulations suitable for topical application to the skin or to the eye 
preferably take the form of an ointment, cream, lotion, paste, gel, spray, aerosol 
and oil. Carriers which may be used include vaseline, lanoline, polyethylene 

25 glycols, alcohols, and combinations of two or more thereof. The topical 

formulations may further advantageously contain 0.05 to 1 5 percent by weight 
of thickeners selected from among hydroxypropyl methyl cellulose, methyl 
cellulose, polyvinylpyrrolidone, polyvinyl alcohol, poly (alkylene glycols), 
poly/hydroxyalkyl, (meth)acrylates or poiy(meth)acrylamides. A topical 

30 formulation is often applied by instillation or as an ointment into the conjunctival 
sac. It can also be used for irrigation or lubrication of the eye, facial sinuses, 
and external auditory meatus. It may also be injected into the anterior eye 



WO 01/57194 



PCT7US01/03471 



-131- 

charnber and other places. The topical formulations in the liquid state may be 
also present in a hydrophilic three-dimensional polymer matrix in the form of a 
strip, contact lens, and the like from which the active components are released. 
For administration by inhalation, the compounds for use herein can be 
5 delivered in the form of an aerosol spray presentation from pressurized packs or 
a nebulizer, with the use of a suitable propellant, e.g., dichlorodifluoromethane, 
trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or other 
suitable gas. In the case of a pressurized aerosol, the dosage unit may be 
determined by providing a valve to deliver a metered amount. Capsules and 
10 cartridges of, e.g., gelatin, for use in an inhaler or insufflator may be formulated 
containing a powder mix of the compound and a suitable powder base such as 
lactose or starch. 

Formulations suitable for buccal (sublingual) administration include, for 
example, lozenges containing the active compound in a flavored base, usually 

15 sucrose and acacia or tragacanth; and pastilles containing the compound in an 
inert base such as gelatin and glycerin or sucrose and acacia. 

The compounds may be formulated for parenteral administration by 
injection, e.g., by bolus injection or continuous infusion. Formulations for 
injection may be presented in unit dosage form, e.g., in ampules or in multi-dose 

20 containers, with an added preservative. The compositions may be suspensions, 
solutions or emulsions in oily or aqueous vehicles, and may contain formulatory 
agents such as suspending, stabilizing and/or dispersing agents. Alternatively, 
the active ingredient may be in powder form for reconstitution with a suitable 
vehicle, e.g., sterile pyrogen-free water or other solvents, before use. 

25 Formulations suitable for transdermal administration may be presented as 

discrete patches adapted to remain in intimate contact with the epidermis of the 
recipient for a prolonged period of time. Such patches suitably contain the 
active compound as an optionally buffered aqueous solution of, for example, 0.1 
to 0.2 M concentration with respect to the active compound. Formulations 

30 suitable for transdermal administration may also be delivered by iontophoresis 
[see, e.g., Pharmaceutical Research 3 (6), 318 (1986)) and typically take the 
form of an optionally buffered aqueous solution of the active compound. 
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The pharmaceutical compositions may also be administered by controlled 
release means and/or delivery devices (see, e.g., in U.S. Patent Nos. 3,536,809; 
3,598,123; 3,630,200; 3,845,770; 3,847,770; 3,91 6,899; 4,008,71 9; 
4,687,610; 4,769,027; 5,059,595; 5,073,543; 5,120,548; 5,354,566; 
5 5,591,767; 5,639,476; 5,674,533 and 5,733,566). 

Desirable blood levels may be maintained by a continuous infusion of the 
active agent as ascertained by plasma levels. It should be noted that the 
attending physician would know how to and when to terminate, interrupt or 
adjust therapy to lower dosage due to toxicity, or bone marrow, liver or kidney 
10 dysfunctions. Conversely, the attending physician would also know how to and 
when to adjust treatment to higher levels if the clinical response is not adequate 
(precluding toxic side effects). 

The efficacy and/or toxicity of the MTSP protein inhibitor(s), alone or in 
combination with other agents can also be assessed by the methods known in 
15 the art (See generally, O'Reilly, Investigational New Drugs, 15:5-1 3 (1 997)). 

The active compounds or pharmaceutical^ acceptable derivatives may be 
prepared with carriers that protect the compound against rapid elimination from 
the body, such as time release formulations or coatings. 

Kits containing the compositions and/or the combinations with 
20 instructions for administration thereof are provided. The kit may further include 
a needle or syringe, preferably packaged in sterile form, for injecting the 
complex, and/or a packaged alcohol pad. Instructions are optionally included for 
administration of the active agent by a clinician or by the patient. 

Finally, the compounds or MTSP proteins or protease domains thereof or 
25 compositions containing any of the preceding agents may be packaged as 

articles of manufacture containing packaging material, a compound or suitable 
derivative thereof provided herein, which is effective for treatment of a diseases 
or disorders contemplated herein, within the packaging material, and a label that 
indicates that the compound or a suitable derivative thereof is for treating the 
30 diseases or disorders contemplated herein. The label can optionally include the 
disorders for which the therapy is warranted. 
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L. METHODS OF TREATMENT 

The compounds identified by the methods herein are used for treating or 
preventing neoplastic diseases in an animal, particularly a mammal, including a 
human, is provided herein. In one embodiment, the method includes 
5 administering to a mammal an effective amount of an inhibitor of an MTSP 
protein, whereby the disease or disorder is treated or prevented. In a preferred 
embodiment, the MTSP protein inhibitor used in the treatment or prevention is 
administered with a pharmaceutical^ acceptable carrier or excipient. The 
mammal treated can be a human. 

10 The inhibitors provided herein are those identified by the screening assays. In 
addition, antibodies and antisense nucleic acids are contemplated. 

The treatment or prevention method can further include administering an 
anti-angiogenic treatment or agent or anti-tumor agent simultaneously with, prior 
to or subsequent to the MTSP protein inhibitor, which can be any compound 

15 identified that inhibits the activity of an MTSP protein, and includes an antibody 
or a fragment or derivative thereof containing the binding region thereof against 
the MTSP protein, an antisense nucleic acid encoding the MTSP protein, and a 
nucleic acid containing at least a portion of a gene encoding the MTSP protein 
into which a heterologous nucleotide sequence has been inserted such that the 

20 heterologous sequence inactivates the biological activity of at least a portion of 
the gene encoding the MTSP protein, in which the portion of the gene encoding 
the MTSP protein flanks the heterologous sequence so as to promote 
homologous recombination with a genomic gene encoding the MTSP protein. 
1 . Antisense treatment 

25 In a specific embodiment, as described hereinabove, MTSP protein 

function is reduced or inhibited by MTSP protein antisense nucleic acids, to treat 
or prevent neoplastic disease. The therapeutic or prophylactic use of nucleic 
acids of at least six nucleotides that are antisense to a gene or cDNA encoding 
MTSP protein or a portion thereof. An MTSP protein "antisense" nucleic acid as 

30 used herein refers to a nucleic acid capable of hybridizing to a portion of an 
MTSP protein RNA (preferably mRNA) by virtue of some sequence 
complementarity. The antisense nucleic acid may be complementary to a coding 
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and/or noncoding region of an MTSP protein mRNA. Such antisense nucleic 
acids have utility as therapeutics that reduce or inhibit MTSP protein function, 
and can be used in the treatment or prevention of disorders as described supra. 
The. MTSP protein antisense nucleic acids are of at least six nucleotides 
5 and are preferably oligonucleotides (ranging from 6 to about 1 50 nucleotides, or 
more preferably 6 to 50 nucleotides). In specific aspects, the oligonucleotide is 
at least 10 nucleotides, at least 15 nucleotides, at least 100 nucleotides, or at 
least 125 nucleotides. The oligonucleotides can be DNA or RIMA or chimeric 
mixtures or derivatives or modified versions thereof, single-stranded or double- 

10 stranded- The oligonucleotide can be modified at the base moiety, sugar moiety, 
or phosphate backbone. The oligonucleotide may include other appending 
groups such as peptides, or agents facilitating transport across the cell 
membrane (see, e.g., Letsinger et al., Proc. Nat!. Acad. Sci. U.S.A. 86:6553- 
6556 (1989); Lemaitre et al., Proc. Natl. Acad. Set. U.S.A. 84:648-652 (1987); 

15 PCT Publication No. WO 88/09810, published December 15, 1988) or blood- 
brain barrier (see, e.g., PCT Publication No. WO 89/10134, published April 25, 
1988), hybridization-triggered cleavage agents (see, e.g., Krol et ai., 
BioTechniques 6:958-976 (1988)) or intercalating agents (see, e.g., Zon, Pharm. 
Res. 5:539-549 (1988)). 

20 The MTSP protein antisense nucleic acid is preferably an oligonucleotide, 

more preferably of single-stranded DNA. In a preferred aspect, the 
oligonucleotide includes a sequence antisense to a portion of human MTSP 
protein. The oligonucleotide may be modified at any position on its structure 
with substituents generally known in the art. 

25 The MTSP protein antisense oligonucleotide may comprise at least one 

modified base moiety which is selected from the group including, but not limited 
to 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, 
xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 
5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 

30 dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 

1- methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 

2- methylguanine, 3-methyIcytosine, 5-methylcytosine, N6-adenine, 
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7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl- 
2-thiouracil, beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 
5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), 
wybutoxosine, pseudouracii, queosine, 2-thiocytosine, 5-methyl-2-thiouraciI, 
5 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, 
uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3-N-2- 
carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 

In another embodiment, the oligonucleotide includes at least one modified 
sugar moiety selected from the group including but not limited to arabinose, 
10 2-flubroarabinose, xylulose, and hexose. The oligonucleotide can include at least 
one modified phosphate backbone selected from a phosphorothioate, a 
phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a 
phosphordiamidate, a methylphosphonate, an alkyl phosphotriester, and a 
formacetal or analog thereof. 
15 The oligonucleotide can be an a-anomeric oligonucleotide. An a-anomeric 

oligonucleotide forms specific double-stranded hybrids with complementary RNA 
in which the strands run parallel to each other (Gautier et al., NucL Acids Res. 
15:6625-6641 (1987)). 

The oligonucleotide may be conjugated to another molecule, e.g., a 
20 peptide, hybridization triggered cross-linking agent, transport agent and 
hybridization-triggered cleavage agent. 

The oligonucleotides may be synthesized by standard methods known in 
the art, e.g. by use of an automated DNA synthesizer (such as are commercially 
available from Biosearch, Applied Biosystems, etc.). As examples, 
25 phosphorothioate oligonucleotides may be synthesized by the method of Stein et 
al. (NucL Acids Res. 1j6:3209 (1988)), methylphosphonate oligonucleotides can 
be prepared by use of controlled pore glass polymer supports (Sarin et al., Proc. 
Natl. Acad. Sci. U.S.A. 85:7448-7451 (1988)), etc. 

In a specific embodiment, the MTSP protein antisense oligonucleotide 
30 includes catalytic RNA, or a ribozyme (see, e.g., PCT International Publication 
WO 90/1 1364, published October 4, 1990; Sarver et al., Science 247 :1222- 
1225 (1990)). In another embodiment, the oligonucleotide is a 2' : 0- 
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methylribonucleotide (Inoue et a!., Nucl. Acids Res. 1.5:6131-6148 (1987)), or a 
chimeric RNA-DNA analogue (Inoue et al., FEBS Lett. 215:327-330 (1987)). 

In an alternative embodiment, the MTSP protein antisense nucleic acid is 
produced intracellular^ by transcription from an exogenous sequence. For 
5 example, a vector can be introduced in vivo such that it is taken up by a cell, 
within which cell the vector or a portion thereof is transcribed, producing an 
antisense nucleic acid (RNA). Such a vector would contain a sequence encoding 
the MTSP protein antisense nucleic acid. Such a vector can remain episomal or 
become chromosomally integrated, as long as it can be transcribed to produce 

10 the desired antisense RNA. Such vectors can be constructed by recombinant 
DNA technology methods standard in the art. Vectors can be plasmid, viral, or 
others known in the art, used for replication and expression in mammalian cells. 
Expression of the sequence encoding the MTSP protein antisense RNA can be by 
any promoter known in the art to act in mammalian, preferably human, cells. 

15 Such promoters can be inducible or constitutive. Such promoters include but are 
not limited to: the SV40 early promoter region (Bernoist and Chambon, Nature 
290 :304-310 (1981), the promoter contained in the 3' long terminal repeat of 
Rous sarcoma virus (Yamamoto et aL, Ce// 22:787-797 (1980), the herpes 
thymidine kinase promoter (Wagner et al., Proc. Nati. Acad. Sci. U.S.A. 

20 78:1441-1445 (1981), the regulatory sequences of the metallothionein gene 
(Brinster et al.. Nature 296 :39-42 (1982), etc. 

The antisense nucleic acids include sequence complementary to at least a 
portion of an RNA transcript of an MTSP protein gene, preferably a human MTSP 
protein gene. Absolute complementarily, although preferred, is not required. 

25 The amount of MTSP protein antisense nucleic acid that will be effective 

in the treatment or prevention of neoplastic disease will depend on the nature of 
the disease, and can be determined empirically by standard clinical techniques. 
Where possible, it is desirable to determine the antisense cytotoxicity in cells in 
vitro, and then in useful animal model systems prior to testing and use in 

30 humans. 
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2. Gene Therapy 

In an exemplary embodiment, nucleic acids that include a sequence of 
nucleotides encoding an MTSP protein or functional domains or derivative 
thereof, are administered to promote MTSP protein function, by way of gene 
5 therapy. Gene therapy refers to therapy performed by the administration of a 
nucleic acid to a subject. In this embodiment, the nucleic acid produces its 
encoded protein that mediates a therapeutic effect by promoting MTSP protein 
function. Any of the methods for gene therapy available in the art can be used 
(see, Goldspiel et at., Clinical Pharmacy 1_2:488-505 (1993); Wu and Wu, 

10 Biotherapy 3:87-95 (1991); Tolstoshev, An. Rev. Pharmacol. Toxicol. 32:573- 
596 (1993); Mulligan, Science 260 :926-932 (1993); and Morgan and Anderson, 
An. Rev. Biochem. 62:191-217 (1993); TIBTECH 11151:1 55-21 5 (1993). For 
example, one therapeutic composition for gene therapy includes an MTSP 
protein-encoding nucleic acid that is part of an expression vector that expresses 

15 an MTSP protein or domain, fragment or chimeric protein thereof in a suitable 
host. In particular, such a nucleic acid has a promoter operably linked to the 
MTSP protein coding region, the promoter being inducible or constitutive, and, 
optionally, tissue-specific. In another particular embodiment, a nucleic acid 
molecule is used in which the MTSP protein coding sequences and any other 

20 , desired sequences are flanked by regions that promote homologous 
recombination at a desired site in the genome, thus providing for 
intrachromosomal expression of the MTSP protein nucleic acid (Koller and 
Smithies, Proc. Natl. Acad. ScL USA 86:8932-8935 (1989); Zijlstra et al.. 
Nature 342 :435-438 (1989)). 
* 25 Delivery of the nucleic acid into a patient may be either direct, in which 

case the patient is directly exposed to the nucleic acid or nucleic acid-carrying 
vector, or indirect, in which case, cells are first transformed with the nucleic acid 
in vitro, then transplanted into the patient. These two approaches are known, 
respectively, as in vivo or ex vivo gene therapy. 

30 In a specific embodiment, the nucleic acid is directly administered in vivo, 

where it is expressed to produce the encoded product. This can be 
accomplished by any of numerous methods known in the art, e.g., by 
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constructing it as part of an appropriate nucleic acid expression vector and 
administering it so that it becomes intracellular, e.g., by infection using a 
defective or attenuated retroviral or other viral vector (see U.S. Patent No. 
4,980,286), or by direct injection of naked DNA, or by use of microparticle 
5 bombardment (e.g., a gene gun; Biolistic, Dupont), or coating with lipids or cell- 
surface receptors or transfecting agents, encapsulation in liposomes, 
microparticles, or microcapsules, or by administering it in linkage to a peptide 
which is known to enter the nucleus, by administering it in linkage to a ligand 
subject to receptor-mediated endocytosis (see e.g., Wu and Wu, J. Biol. Chem. 

10 262 :4429-4432 (1987)) (which can be used to target cell types specifically 
expressing the receptors), etc. In another embodiment, a nucleic acid-ligand 
complex can be formed in which the ligand is a fusogenic viral peptide to disrupt 
endosomes, allowing the nucleic acid to avoid lysosomal degradation. In yet 
another embodiment, the nucleic acid can be targeted in vivo for cell specific 

15 uptake and expression, by targeting a specific receptor (see, e.g., PCT 

Publications WO 92/06180 dated April 16, 1992 (Wu et al.); WO 92/22635 
dated December 23, 1992 (Wilson et al.); WO92/20316 dated November 26, 
1992 (Findeis et al.); W093/14188 dated July 22, 1993 (Clarke et al.), WO 
93/20221 dated October 14, 1993 (Young)). Alternatively, the nucleic acid can 

20 be introduced intracellular^ and incorporated within host cell DNA for 

expression, by homologous recombination (Koller and Smithies, Proc. Natl. Acad. 
Sci. USA 86:8932-8935 (1989); Zijlstra et al., Nature 342:435-438 (1989)). 

In a specific embodiment, a viral vector that contains the MTSP protein 
nucleic acid is used. For example, a retroviral vector can be used (see Miller et 

25 al., Meth. EnzymoL 217 :581-599 (1993)). These retroviral vectors have been 
modified to delete retroviral sequences that are not necessary for packaging of 
the viral genome and integration into host cell DNA. The MTSP protein nucleic 
acid to be used in gene therapy is cloned into the vector, which facilitates 
delivery of the gene into a patient. More detail about retroviral vectors can be 

30 found in Boesen et al., Biotherapy 6:291-302 (1 994), which describes the use of 
a retroviral vector to deliver the mdrl gene to hematopoietic stem cells in order 
to make the stem cells more resistant to chemotherapy. Other references 
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illustrating the use of retroviral vectors in gene therapy are: Clowes et al., J. 
Clin. Invest 93:644-651 (1994); Kiem et al.. Blood 83:1467-1473 (1994); 
Salmons and Gunzberg, Human Gene Therapy 4:1 29-141 (1993); and Grossman 
and Wilson, Curr. Opin. in Genetics and DeveL 3:1 10-1 14 (1993). 
5 Adenoviruses are other viral vectors that can be used in gene therapy. 

Adenoviruses are especially attractive vehicles for delivering genes to respiratory 
epithelia. Adenoviruses naturally infect respiratory epithelia where they cause a 
mild disease. Other targets for adenovirus-based delivery systems are liver, the 
central nervous system, endothelial cells, and muscle. Adenoviruses have the 

10 advantage of being capable of infecting non-dividing cells. Kozarsky and Wilson, 
Current Opinion in Genetics and Development 3:499-503 (1993) present a 
review of adenovirus-based gene therapy. Bout et al., Human Gene Therapy 
5:3-10 (1994) demonstrated the use of adenovirus vectors to transfer genes to 
the respiratory epithelia of rhesus monkeys. Other instances of the use of 

15 adenoviruses in gene therapy can be found in Rosenfeld et al., Science 252 :431- 
434 (1991); Rosenfeld et al.. Cell 68:143-1 55 (1992); and Mastrangeli et al., J. 
Clin. Invest. 91:225-234 (1993). 

Adeno-associated virus (AAV) has also been proposed for use in gene 
therapy (Walsh et al., Proc. Soc. Exp. Biol. Med. 204:289-300 (1993). 

20 Another approach to gene therapy involves transferring a gene to cells in 

tissue culture by such methods as electroporation, lipofection, calcium 
phosphate mediated transfection, or viral infection. Usually, the method of 
transfer includes the transfer of a selectable marker to the cells. The cells are 
then placed under selection to isolate those cells that have taken up and are 

25 expressing the transferred gene. Those cells are then delivered to a patient. 

In this embodiment, the nucleic acid is introduced into a cell prior to 
administration in vivo of the resulting recombinant cell. Such introduction can 
be carried out by any method known in the art, including but not limited to 
transfection, electroporation, microinjection, infection with a viral or 

30 bacteriophage vector containing the nucleic acid sequences, cell fusion, 
chromosome-mediated gene transfer, microceli-mediated gene transfer, 
spheroplast fusion, etc. Numerous techniques are known in the art for the 
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introduction of foreign genes into cells (see e.g., Loeffler and Behr, Meth. 
EnzymoL 217:599-618 (1993); Cohen et al., Meth. Enzymol. 217:618-644 
(1993); Cline, Pharmac. Ther. 29:69-92 (1985)) and may be used, provided that 
the necessary developmental and physiological functions of the recipient cells 
5 are not disrupted. The technique should provide for the stable transfer of the 
nucleic acid to the cell, so that the nucleic acid is expressible by the cell and 
preferably heritable and expressible by its cell progeny. 

The resulting recombinant cells can be delivered to a patient by various 
methods known in the art. In a preferred embodiment, epithelial cells are 
10 injected, e.g., subcutaneously. In another embodiment, recombinant skin cells 
may be applied as a skin graft onto the patient. Recombinant blood cells {e.g., 
hematopoietic stem or progenitor cells) are preferably administered 
intravenously. The amount of cells envisioned for use depends on the desired 
effect, patient state, etc., and can be determined by one skilled in the art. 
15 Cells into which a nucleic acid can be introduced for purposes of gene 

therapy encompass any desired, available cell type, and include but are not 
limited to epithelial cells, endothelial cells, keratinocytes, fibroblasts, muscle 
cells, hepatocytes; blood cells such as T lymphocytes, B lymphocytes, 
monocytes, macrophages, neutrophils, eosinophils, megakaryocytes, 
20 granulocytes; various stem or progenitor cells, in particular hematopoietic stem 
or progenitor cells, e.g., as obtained from bone marrow, umbilical cord blood, 
peripheral blood, fetal liver, etc. 

In a preferred embodiment, the cell used for gene therapy is autologous to 
the patient. In an embodiment in which recombinant cells are used in gene 
25 therapy, an MTSP protein nucleic acid is introduced into the cells such that it is 
expressible by the cells or their progeny, and the recombinant cells are then 
administered in vivo for therapeutic effect. In a specific embodiment, stem or 
progenitor cells are used. Any stem and/or progenitor cells which can be 
isolated and maintained in vitro can potentially be used in accordance with this 
30 embodiment. Such stem cells include but are not limited to hematopoietic stem 
cells (HSC), stem cells of epithelial tissues such as the skin and the lining of the 
gut, embryonic heart muscle cells, liver stem cells (PCT Publication WO 
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94/08598, dated April 28, 1994), and neural stem cells (Stemple and Anderson, 
Cell 71:973-985 (1992)). 

Epithelial stem cells (ESCs) or keratinocytes can be obtained from tissues 
such as the skin and the lining of the gut by known procedures (Rheinwald, 
5 Meth. Cell Bio. 21 A :229 (1980)). In stratified epithelial tissue such as the skin, 
renewal occurs by mitosis of stem cells within the germinal layer, the layer 
closest to the basal lamina. Stem cells within the lining of the gut provide for a 
rapid renewal rate of this tissue. ESCs or keratinocytes obtained from the skin 
or lining of the gut of a patient or donor can be grown in tissue culture 

10 (Rheinwald, Meth. Cell Bio. 21A :229 (1980); Pittelkow and Scott, Mayo Clinic 
Proc. 61:771 (1986)). If the ESCs are provided by a donor, a method for 
suppression of host versus graft reactivity {e.g., irradiation, drug or antibody 
administration to promote moderate immunosuppression) can also be used. 

With respect to hematopoietic stem cells (HSC), any technique which 

15 provides for the isolation, propagation, and maintenance in vitro of HSC can be 
used in this embodiment. Techniques by which this may be accomplished 
include (a) the isolation and establishment of HSC cultures from bone marrow 
cells isolated from the future host, or a donor, or (b) the use of previously 
established long-term HSC cultures, which may be allogeneic or xenogeneic. 

20 Non-autologous HSC are used preferably in conjunction with a method of 

suppressing transplantation immune reactions of the future host/patient. In a 
particular embodiment, human bone marrow cells can be obtained from the 
posterior iliac crest by needle aspiration (see, e.g., Kodo et al., J. Clin. Invest. 
73:1377-1384 (1984)). In a preferred embodiment, the HSCs can be made 

25 highly enriched or in substantially pure form. This enrichment can be 

accomplished before, during, or after long-term culturing, and can be done by 
any techniques known in the art. Long-term cultures of bone marrow cells can 
be established and maintained by using, for example, modified Dexter cell culture 
techniques (Dexter et aL, J. Cell Physiol. 91:335 (1977) or Witlock-Witte culture 

30 techniques (Witlock and Witte, Proc. Natl. Acad. Sci. USA 79:3608-3612 
(1982)). 
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ln a specific embodiment, the nucleic acid to be introduced for purposes 
of gene therapy includes an inducible promoter operably linked to the coding 
region, such that expression of the nucleic acid is controllable by controlling the 
presence or absence of the appropriate inducer of transcription. 
5 3. Prodrugs 

A method for treating tumors is provided. The method is practiced by 
administering a prodrug that is specifically cleaved by an MTSP to release an 
active drug. Upon contact with a cell that expresses MTSP activity, the prodrug 
is converted into an active drug. The prodrug can be a conjugate that contains 

10 the active agent, such as an anti-tumor drug, such as a cytotoxic agent, or other 
atherapeutic agent, linked, linked to a substrate for the targeted MTSP, such that 
the drug or agent is inactive or unable to enter a cell, in the conjugate, but is 
activated upon cleavage. The prodrug, for example, can contain an oligopeptide, 
preferably a relatively short, less than about 10 amino acids peptide, that is 

1 5 selectively proteolytically cleaved by the targeted MTSP. Cytotoxic agents, 
include, but are not limited to, alkylating agents, antiproliferative agents and 
tubulin binding agents. Others include, vinca drugs, mitomycins, bleomycins and 
taxanes. 

M. ANIMAL MODELS 

20 Transgenic animal models are provided herein. Such an animal can be 

initially produced by promoting homologous recombination between an MTSP 
protein gene in its chromosome and an exogenous MTSP protein gene that has 
been rendered biologically inactive (preferably by insertion of a heterologous 
sequence, e.g., an antibiotic resistance gene). In a preferred aspect, this 

25 homologous recombination is carried out by transforming embryo-derived stem 
(ES) cells with a vector containing the insertionally inactivated MTSP protein 
gene, such that homologous recombination occurs, followed by injecting the ES 
cells into a blastocyst, and implanting the blastocyst into a foster mother, 
followed by the birth of the chimeric animal ("knockout animal") in which an 

30 MTSP protein gene has been inactivated (see Capecchi, Science 244:1288-1292 
(1989)). The chimeric animal can be bred to produce additional knockout 
animals. Such animals can be mice, hamsters, sheep, pigs, cattle, etc., and are 
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preferably non-human mammals. In a specific embodiment, a knockout mouse is 
produced. 

Such knockout animals are expected to develop or be predisposed to 
developing neoplastic diseases and thus can have use as animal models of such 
5 diseases e.g., to screen for or test molecules for the ability to treat or prevent 
such diseases or disorders. Hence, the animal models for are provided. Such 
an animal can be initially produced by promoting homologous recombination 
between an MTSP gene in its chromosome and an exogenous MTSP protein 
gene that would be over-expressed or mis-expressed (preferably by expression 

10 under a strong promoter). In a preferred aspect, this homologous recombination 
is carried out by transforming embryo-derived stem (ES) cells with a vector 
containing the over-expressed or mis-expressed MTSP protein gene, such that 
homologous recombination occurs, followed by injecting the ES cells into a 
blastocyst, and implanting the blastocyst into a foster mother, followed by the 

15 birth of the chimeric animal in which an MTSP gene has been over-expressed or 
mis-expressed (see Capecchi, Science 244:1 288-1 292 (1989)). The chimeric 
animal can be bred to produce additional animals with over-expressed or mis- 
expressed MTSP protein. Such animals can be mice, hamsters, sheep, pigs, 
cattle, etc., and are preferably non-human mammals, in a specific embodiment, 

20 a mouse with over-expressed or mis-expressed MTSP protein is produced. 



The following examples are included for illustrative purposes only and are 
not intended to limit the scope of the invention. 

EXAMPLE 1 

Cloning of MTSP3, cloning and mutagenesis of the Protease domain of MTSP3 
1. Identification and cloning of MTSP3 

a. Identification of EST clones AI924527 and AI924182 as 
part of a serine protease MTSP3 

DNA encoding the protease domain of the protease designated MTSP1 
was independently cloned from the human prostatic adenocarcinoma cell line, 
PC-3, using degenerate oligonucleotide primers, then sequenced and 
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characterized (see EXAMPLE 6). The sequence of the sense degenerate primer 
used in cloning MTSP1 was 5'-TGGRT(l)VT{l)WS(l)GC(l»RC{l)CAYTG-3' (SEQ ID 
No: 13), and that of the anti-sense was 

5'-(l)GG(l)CC(l)CC(l)SWRTC(l)CCYT(l)RCA(l)GHRTC-3' {SEQ ID No:14), where 
5 R = A,G; V = G,A,C; W = A,T; S = G,C; Y = C,T; H = A,T,C. The primer sequences 
correspond to two highly conserved regions in all serine proteases and should 
amplify PCR products ranging from 400 to 500 base pairs. MTSP1 was 
subsequently found to be identical to matriptase (Genbank accession number 
AF1 18224; see also Takeuchi et al. f Proc. Natl. Acad. Set. USA, 
10 961201:11054-61 (1999); and Lin et al., J. Biol. Chem.. 274(26): 1 8231 -6 
1999). 

Using the protein sequence of the protease domain of the serine protease 
MTSP1 , the EST database (dbEST) at the National Center for Biotechnology 
Information (Bethesda, MD; www.ncbi.nlm.nih.gov) was searched for EST 

15 clones that contain similar or identical sequences to MTSP1 using the search 
algorithm tblastn. The tblastn algorithm compares a protein query sequence 
against a nucleotide sequence database dynamically translated in all six reading 
frames (both strands). The sequences for two identical EST clones 
(NCI_CGAP_Lu 1 9 AI924527 and AI924182) derived from human lung tumor 

20 tissue showed 43% identity with the MTSP.1 protein sequence. Subsequent 

search of GenBank and SwissProt database for the EST sequence AI924527 and 
AI924182 did not show any matching sequence to MTSP1, indicating that the 
sequence contained in these EST clones AI924527 and AI924182 may be 

portions of a new serine protease. 
25 b. PCR cloning of a cDNA fragment of another membrane type 

serine protease MTSP3 

The double-stranded Marathon-Ready(tm) cDNA library derived from 
human lung carcinoma (LX-1 ) was obtained from Clontech (Palo Alto, CA; 
30 catalog # 7495-1) and used as a template. Two primers, 

5'-TCACCGAGAAGATGATGTGTGCAGGCATCC-3' (SEQ ID No:15) (sense 
primer), and 5'-GGGACAGGGGCTGTAAGGCAGGGAATGAG-3' (SEQ ID No:16) 
(antisense primer), were used to amplify a -360 bp DNA fragment. The PCR 
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product was separated on a 2% agarose gel and purified using a gel extraction 
kit (catalog number 28706; QIAquick gel extraction kit; Giagen). The purified 
DNA fragment was ligated into TA vectors (catalog number K4500-01; 
TOPO-TA cloning kit, Invitrogen, Carlsbad, CA). After transformation into E. coli 
5 cells, plasmids were isolated and analyzed by digestion with EcoRI restriction 
enzyme. Clones that had inserted DNA were further characterized by 
sequencing using a fluorescent dye-based DNA sequencing method (catalog 
number 4303149; BigDye terminator cycle sequencing kit with AmpliTaq DNA 
polymerase; Perkin Elmer, Lincoln, CA). . 
10 The DNA sequence obtained was analyzed and has 43% identity with the 

MTSP1 protein sequence. This indicates that the LX-1 cDNA library contains a 
desired nucleic acid molecule. It was used to isolate a cDNA clone 
encompassing a full length protease. 

c. 5'- and 3'- rapid amplification of cDNA ends (RACE) 
15 To obtain the full-length cDNA that encoded this serine protease, 

hereafter called MTSP3, 5'- and 3'-RACE reactions were performed. The 
Marathon-Ready cDNA library from human lung carcinoma (LX-1) was used to 
isolate the 5' and 3' ends of the cDNA encoding MTSP3. Marathon-Ready 
cDNA is specifically made for RACE reactions. Two gene specific primers were 

20 used: 5'-CCCGCAGCCATAGCCCCAGCTAACG-3' (SEQ ID No. 17) for 5'-RACE 
reaction and 5'-GCAGACGATGCGTACCAGGGGGAAGTC-3' (SEQ ID No. 18) for 
3'-RACE reaction. Two fragments, approximately 1 .8 kbp and 0.85 kbp, were 
isolated that correspond to the missing 5' and 3' end sequences, respectively. 
These fragments were subcloned as described above. They were further 

25 confirmed by Southern analysis using an internal cDNA fragment encompassing 
the 2 primers used in the RACE reactions as probe and by DNA sequence 
analysis. 

d. PCR amplification of cDNA encoding full-length protease 
domain of MTSP3 

30 

To obtain the cDNA fragment encoding the protease domain of MTSP3, 
an end-to-end PCR amplification using gene-specific primers was used. The two 
primers used were: 
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B'-CTCGAGAAAAG AGTGGTGGGTGGGGAGGAGGCCTCTGTG -3' (SEQ ID No. 
19) for the 5' end and 5'-GCGGCCGCAHACAGCTCAGCCTTCCAGAC-3' (SEQ 
ID No. 20) for the 3' end. The 5' primer contains the sequence (underlined) 
that encodes the start of the MTSP3 protease domain (VVGGEEASV). The 3' 
5 primer contains the stop codon (underlined) of MTSP3. A ~700-bp fragment 
was amplified and subcloned into a Pichia pastoris expression vector, pPIC9K. 
e. C310S mutagenesis of MTSP3 

To eliminate the free cysteine (at position 310 in SEQ ID No. 4) that 
exists when the protease domain of the MTSP3 protein is expressed or the 

10 zymogen is activated, the free cysteine at position 310 (see SEQ ID No. 3), 
which is Cys1 22 if a chymotrypsin numbering scheme is used, was replaced 
with a serine. The resulting vector was designated pPIC9K:MTSP3C1 22S. 

The gene encoding the protease domain of MTSP3 was mutagenized by 
PCR SOE (PCR-based splicing by overlap extension) to replace the unpaired 

15 cysteine at position 310 (122 chymotrypsin numbering system) with a serine. 
Two overlapping gene fragments, each containing the TCT codon for serine at 
position 310 were PCR amplified using the following primers: for the 5' gene 
fragment, TCTCTCGAGAAAAGAGTGGTGGGTGGGTGGGGAGGAGGCCTCTGTG 

SEQ ID No. 51 and 

20 GCTCCTCATCAAAGAAGGGCAGAGAGATGGGCCTGACTGTGCC SEQ ID No. 
52; for the 3' gene fragment, 

ATTCGCGGCCGCATTACAGCTCAGCCTTCCAGAC (SEQ ID No. 53) and 

GGCACAGTCAGGCCCATCTCTCTGCCCTTCTTTGATGAGGAGC (SEQ ID No. 

54). The amplified gene fragments were purified on a 1 % agarose gel, mixed 
25 and reamplified by PCR to produce the full length coding sequence for MTSP3 

C122S. This sequence was then cut with restriction enzymes Notl and Xhol, 

and ligated into vector pPic9K. 

2. Sequence analysis 

All derived DNA and protein sequences were analyzed using MacVector 
30 (version 6.5; Oxford Molecular Ltd., Madison, Wl). The full-length cDNA 

encoding MTSP3 is composed of 2,137 base pairs containing the longest open 

reading frame of 1,314 base pairs which translate to a 437-amino acid protein 
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sequence. The cDNA fragment (nt 873-1 ,574) encoding the protease domain of 
MTSP3 is composed of 702 base pairs which translate to a 233-amino acid 
protein sequence plus the stop codon. The DNA sequence and the translated 
protein sequence of MTSP3 are shown in SEQ ID Nos. 3 and 4, respectively. 
5 3. Construction of the Expression Vectors 

DNA encoding MTSP3 full length protein containing the C310S point 
mutation {i.e., MTSP3C1 22S) was cloned from pPIC9K:MTSP3C1 22S. The 
primers MTSP3: 

5' GAATTCCATATGCCGCGCTTTAAAGTGGTGGGTGGGGAGGAGGCC SEQ ID 

10 No. 47 (containing a Ndel restriction site) and MTSP3- 

3' GGGATACCCGTTACAGCTCAGCCTTCCAGAC 5' SEQ ID No. 48 (containing 
a BamHI restriction site) were used to PCR amplify the human MTSP3C1 22S 
protease domain utilizing a plasmin recognition sequence (PRFK) for zymogen 
activation. Amplification was conducted in a total volume of 50 /yl containing 10 

15 mM KCI, 20 mM Tris-HCI (pH 8.8 at 25 °C), 10 mM (NH 4 ) 2 S0 4 , 2.0 mM 

MgS0 4 , 0.1 % Triton X-100, 0.3 mM dNTPs, 5.0 units of vent DNA polymerase, 
and 100 pmol of primers. The reaction mixtures were heated to 95 °C for 5 
min, followed by 25-30 cycles of 95, 60, and 75 °C for 30 s each and a final 
extension at 75 °C for 2 min. 

20 PCR products were purified using a QIAquick PCR purification kit 

(QIAGEN Inc., Chatsworth, CA). Full-length oligonucleotides were doubly 
digested with 10 units BamHI and 20 units Ndel for 2 h at 37 °C. The digested 
fragments were purified on a 1 .3% agarose gel and stained with ethidium 
bromide. The band containing the MTSP3C1 22S encoding DNA was excised and 

25 purified using a QIAEX II gel extraction kit. 

The MTSP3C1 22S encoding DNA was then cloned into the Ndel and 
BamHI sites of the pET19b vector (Novagen) using standard methods. This 
vector allows the fusion of a HIS 6 tag for purification by metal affinity 
chromatography (MAC). Competent XL1 Blue cells (Stratagene) were 

30 transformed with the pET1 9b-MTSP3C1 22S vector and used to produce 
plasmid stocks. Proper insertion and DNA sequence were confirmed by 
fluorescent thermal dye DNA sequencing methods as well as restriction digests. 
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4. Protein Expr ssion. Purification, and Refolding 

Overexpression of the gene product was achieved in E. coli strain 
BL2KDE3) (Novagen, Madison Wl) containing the DNAY plasmid for rare codon 
optimization (see, e.g., Garcia etaL (1986) Ce//45.\453-459). Cells were grown 
5 at 37 °C in (2xYT) media supplemented with carbenicillin and kanamycin to a 
final concentrations of 50 ug/ml and 34 ug/ml, respectively. One liter cultures 
were inoculated with 10 mL of an overnight culture grown in the same media. 
Cells were allowed to grow to a density of 0.6 - 1 .0 OD 600 before the addition 
of IPTG (final concentration 1 .0 mM). Cells were grown an additional 4 hours 

10 before harvesting. 

The cell pellet was resuspended in 20 mL of lysis buffer (50 mM 
Na 2 HP0 4 , 300 mM NaCI, pH 7.4). The cell suspension was treated with 10-20 
mg lysozyme and incubated at 37 °C for 1 hour. DNasel was then added (1- 
2mg) with mixing until the solution was no longer viscous. The solution was 

15 then transferred to a Rosette flask and sonicated, on ice, at high power for 1 5 
min. Inclusion bodies were pelleted by centrifugation at 20K rpm (-48,000 g) 

at 4°C for 30 min. 

Inclusion bodies were washed by douncing 2 times in 50 mM Na 2 HP0 4 , 
300 mM NaCI, 5% LADO, pH 7.4 followed by 2 times in 50 mM Na 2 HP0 4 , 300 

20 mM NaCI, pH 7.4. Inclusion bodies (~ 500 mg) are solubilized in 25 mL 6 M 
GuHCI, 1 00 mM tris-HCI, 20 mM £Me, pH 8.0. This solution was spun at 20K 
rpm for 30 minutes to pul( down any particulate matter. This solution was 
passed through a 0.2 jt/M filter and diluted to 100 mL in solubilization buffer. 

MTSP3C1 22S was refolded by slowly adding the inclusion body mixture 

25 to 8 L of refolding buffer (100 mM tris-HCI, 150 mM NaCI, 5 mM GSH, 0.05 
mM GSSG, 1 M arginine, pH 8.0) using a peristaltic pump. The refolding 
mixture was allowed to stir at 4°C for 7 days or until the thiol concentration 
was below 1 mM as detected by Ellman's reagent. The solution was filtered 
through a 5 //M filter, concentrated by ultrafiltration and the buffer exchanged 

30 into MAC equilibration buffer (50 mM Na 2 HP0 4 , 300 mM NaCI, 10 mM 

imidazole, pH 8.0) by crossflow filtration. The resulting solution was passed 
through a 0.2 //M filter and further purified on a FPLC (Amersham-Pharmacia) 
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using Pharmacia chelating sepharose. The solution was loaded onto the nickel 

loaded MAC at a flow rate of 1 .0 mL/min and eluted with a linear gradient of 1 .0 

mM -1 .0 M imidazole in 50 mM Na 2 HP0 4 , 300 mM NaCI, pH 8.0. Protein 

containing fractions were determined by SDS-PAGE and subsequently pooled 

5 and frozen at -80°C. 

Small amounts of purified MTSP3C1 22S were activated using piasmin 

sepharose for 30 min. at 37 °C. The resin was spun down at 14K rpm for 5 

min. and the protein solution removed. The resulting solution was screened for 

activity against of series of protease substrates; spec-tpa, spec-pl, spec-UK, 

10 spec-fXIla (American Diagnostica), S-2238, S-2266 (Kabi Diagnostica), S-2586, 

S-2366, S-2444, S-2288, S-2251, S-2302, S-2765, S-2222, spec-THE 

(Chromogenix), spec-fVlla (Pentapharm). MTSP3C1 22S cleaved several of these 

substrates efficiently but was most active towards Spec-fXIla, Spec-tPA, S- 

2765, Spec-fVlla and S-2444. 

15 5. Gene expression profile of the serine protease MTSP3 in normal and 

tumor tissues 

To obtain information regarding the tissue distribution of the MTSP3 
transcripts, the DNA insert encoding the MTSP3 protease domain was used to 

20 probe a RNA blot composed of 76 different human tissues (catalog number 

7775-1 ; human multiple tissue expression (MTE) array; CLONTECH, Palo Alto, 
CA). The expression pattern observed in decreasing signal level was: trachea = 
colon (descending) = esophagus > colon (ascending) > colon (transverse) = 
rectum > ileum > duodenum > jejunum > bladder > ilocecum > stomach > 

25 kidney > appendix. It is also expressed less abundantly in fetal kidney, and in 
two tumor cell lines, HeLa S3 and leukemia, K-562. Northern analysis using 
RNA blots (catalog numbers 7780-1, 7765-1 & 7782-1; human 12-lane, human 
muscle and human digestive system multiple tissue northern (MTN) blots; 
CLONTECH) confirmed that the expression was detected most abundantly in the 

30 colon, moderately in the esophagus, small intestine, bladder and kidney, and less 
abundantly in stomach and rectum, A single transcript of —2.2 kb was 
detected. 
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Amplification of the MTSP3 transcript in several human primary tumors 
xenografted in mouse was performed using gene-specific primers. The MTSP3 
transcript was detected in lung carcinoma (LX-1), colon adenocarcinoma (CX-1), 
colon adenocarcinoma (GI-1 12) and ovarian carcinoma (GI-102). No apparent 
5 signal was detected in another form of lung carcinoma (GI-1 17), breast 
carcinoma (GI-1 01), pancreatic adenocarcinoma (GI-1 03) and prostatic 
adenocarcinoma (PC3). 

EXAMPLE 2 
Identification of genomic clone of MTSP4 

10 Using the nucleotide sequence encoding the protease domain of the 

serine protease MTSP1 (also called matriptase), the protein database 
(SWISSPROT) at the National Center for Biotechnology Information (Bethesda, 
MD; <http://www.ncbi.nlm.nih.gov>) was searched for similar or identical 
sequence to MTSP1 using the search algorithm blastx. The blastx algorithm 

15 compares the six-frame conceptual translation products of a nucleotide query 
sequence (both strands) against a protein sequence database. A protein 
encoding sequence (CAA18442) that has 37% identity to the MTSP1 protein 
sequence that was found to include a putative LDL-receptor domain and a 
trypsin-like serine protease domain was identified. This protein-encoding 

20 sequence (hereinafter referred to as MTSP4) was found to be encoded by a 

genomic clone (AL022314) derived from human chromosome 22 sequenced by 
the Sanger Centre Chromosome 22 Mapping Group and deposited into the public 
database as part of the Human Genome Project. Subsequent search of the 
GenBank database showed that no identical sequence has been deposited. A 

25 search of the EST database also did not show any matching human sequence, 
indicating that no human EST clone exists in the public database. Mouse EST 
clones (AI391417 and AA208793) are present and showed 88% identity to the 
serine protease at the nucleotide level. 

PCR cloning of a genomic DNA fragment of MTSP4 for use as hybridization 
30 probe 

In order to obtain tissue distribution profile of MTSP4 as well as to 
identify a tissue source for subsequent cloning of the cDNA, a genomic fragment 
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was amplified from human genomic DNA using two gene-specific primers, 
5'-CCTCCACGGTGCTGTGGACCGTGTTCC-3' (5' primer) SEQ ID No. 21 and 
5'-CCTCGCGCAAGGCGCCCCAGCCCG-3' (3' primer) SEQ ID No. 22. These 
two primers amplified a 265-base pair fragment within a single exon of MTSP4. 
5 The fragment was then used as a hybridization probe on human tissue northern 
blot (human 12-lane multiple tissue northern (MTN) blot (catalog number 
7780-1); CLONTECH, Palo Alto, CA). A prominent band (-2.6 kb) was 
detected in liver. Relatively weaker signals were obtained from the brain, heart, 
skeletal muscle and kidney. Since human liver showed a very strong signal, this 

10 tissue was selected for the amplification of the MTSP4 cDNA. 
5'- and 3'- rapid amplification of cDNA ends (RACE) 

To obtain a full-length clone encoding MTSP4, 5'- and 3'-RACE reactions 
were performed. The Marathon-Ready cDNA library from human liver 
(CLONTECH) was used to isolate the 5' and 3' ends of the cDNA encoding 

15 MTSP4. Marathon-Ready cDNA clones are specifically made for RACE reactions. 
Two gene specific primers were used: 

5'-GCGTGGCGTCACCTGGTAGCGATAGACCTCGC -3' (SEQ ID No. 23) for 
5'-RACE reaction and 5'-CCTCCACGGTGCTGTGGACCGTGTTCC-3' (SEQ ID No. 
24) for 3'-RACE reaction. No fragment was obtained from the initial 5'-RACE 
20 reaction. 

The 3'-RACE reaction, however, produced a -1.5 kbp fragment. A 
nested PCR reaction was used on the initial 5'-RACE reaction products to obtain 
part of the 5' end of MTSP4. The nested 5' gene-specific primer used was 
5'-CCTCGCGCAAGGCGCCCCAGCCCG-3' (SEQ ID No. 25) and produced a 

25 —0.8 kbp fragment. The fragments were subcloned into pCR2.1-TOPO TA 

cloning vector (Invitrogen, Carlsbad, CA). The resulting clones were analyzed by 
Southern analysis using the internal genomic fragment encompassing the primers 
used in the RACE reactions as probe and by DNA sequence analysis. Sequence 
analysis of the 5'-RACE product showed that the potential initiation codon was 

30 still missing. 

To obtain the 5' cDNA end that encodes the N terminus of MTSP4, the 
publicly available genomic sequence of chromosome 22 was searched for 
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sequence corresponding to the sequence obtained in the 5'-RACE clone. The 
resulting genomic sequence was translated and the protein sequence was 
compared to that derived from the translated sequence of the 5'-RACE clone. 
After determining the overlapping sequences, a gene-specific oligonucleotide 
5 primer (5'-TCATCGGCCAGAGGGTGATCAGTGAG-3') SEQ ID No. 26 

corresponding to the sequence upstream of the potential initiation codon and 
another gene-specific oligonucleotide primer 

(5'-CCTCCTCAGTGCATAGGCATCAAACCAG-3') SEQ ID No. 27 corresponding 
to a sequence within the overlapping region were used to amplify the missing 5' 
10 cDNA of MTSP4 from the human liver cDNA library. 
Splice variants and domain organization of MTSP4 

At least two cDNA fragments were consistently obtained during PCR 
amplification, indicating multiple splice variants of MTSP4. Subcloning and 
sequence analysis revealed that a longer, more abundant form, MTSP4-L and a 
15 shorter form, MTSP4-S. The encoded proteins are multi-domain, type II 

membrane-type serine proteases and include a transmembrane domain at the N 
terminus followed by a CUB domain, 3 LDLR domains and a trypsin-like serine 
protease domain at the C terminus. The difference between these two forms of 
MTSP4 is the absence in MTSP4-S of a 432-bp nucleotide sequence between 
20 the transmembrane and the CUB domains (see FIGURE 2). 

PCR amplification of cDNA encoding full-length protease domain of MTSP4 

To obtain a cDNA fragment encoding the protease domain of MTSP4, an 
end-to-end PCR amplification using gene-specific primers and the 
Marathon-Ready cDNA library from human liver was used. The two primers 
25 used were: B'-TCTCTCGAGAAAAGAATTGTTGGTGGAGCTGTGTCCTCCGAG 
-3' (SEQ ID No. 28 ) for the 5' end and 

5'-AGGTGGGCCTTGCTTTGCAGGGGGGCAGTTC-3' for the 3' end SEQ ID NO. 
29). The 5' primer contained the sequence that encodes the start of the MTSP4 
protease domain (IVGGAVSSE). The 3' primer corresponds to the sequence just 
30 downstream of the stop codon. A ~740-bp fragment was amplified, subcloned 
into pCR2.1-TOPO TA cloning vector and sequenced. 
Gene expression profile of MTSP4 in normal and tumor tissu s 
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To obtain information regarding the gene expression profile of the MTSP4 
transcript, a DNA fragment encoding part of the LDL receptor domain and the 
protease domain was used to probe an RNA blot composed of 76 different 
human tissues (catalog number 7775-1; human multiple tissue expression (MTE) 
5 array; CLONTECH). As in the northern analysis of gel blot, a very strong signal 
was observed in the liver. Signals in other tissues were observed in (decreasing 
signal level): fetal liver > heart = kidney = adrenal gland = testis = fetal heart 
and kidney = skeletal muscle = bladder = placenta > brain = spinal cord = 
colon = stomach = spleen = lymph node = bone marrow = trachea - uterus 

10 = pancreas = salivary gland — mammary gland = lung. MTSP4 is also 
expressed less abundantly in several tumor cell lines including HeLa S3 = 
leukemia K-562 = Burkitt's lymphomas (Raji and Daudi) = colorectal 
adenocarcinoma (SW480) > lung carcinoma (A549) = leukemia MOLT-4 = 
leukemia HL-60. PCR of the MTSP4 transcript from cDNA libraries made from 

15 several human primary tumors xenografted in nude mice (human tumor multiple 
tissue cDNA panel, catalog number K1 522-1, CLONTECH) was performed using 
MTSP4-specific primers. The MTSP4 transcript was detected in breast 
carcinoma (GI-101), lung carcinoma (LX-1), colon adenocarcinoma (GI-112) and 
pancreatic adenocarcinoma (GI-103). No apparent signal was detected in 

20 another form of lung carcinoma (GI-117), colon adenocarcinoma (CX-1), ovarian 
carcinoma (GI-102). and prostatic adenocarcinoma (PC3). The MTSP4 
transcript was also detected in LNCaP and PC-3 prostate cancer cell lines as well 
as in HT-1080 human fibrosarcoma cell line. 
Sequence analysis 

25 MTSP4 DNA and protein sequences were analyzed using MacVector 

(version 6.5; Oxford Molecular Ltd., Madison, Wl). The ORF of MTSP4-L 
includes 2,409 bp, which translate to a 802-amino acid protein, while. the ORF 
of MTSP4-S is composed of 1 ,977 bp which translate to a 658-amino acid 
protein. The cDNA encoding the protease domain in both forms is composed of 

30 708 bp which translate to a 235-amino acid protein sequence (see, SEQ ID No. 
6) The DNA sequences and the translated protein sequences of MTSP4-L and 



WO 01/57194 



PCTAJS01/03471 



-154- 

MTSP4-S, and of the protease domain of MTSP4 are set forth in SEQ ID Nos. 8, 
10 and 6, respectively. 

EXAMPLE 3 

Cloning of MTSP6 
5 Identification of genomic clone of MTSP6 

Using the protein sequence of the protease domain of the serine protease 
MTSP4 (see EXAMPLE 2), the non-redundant database (alt non-redundant 
GenBank CDS translations + PDB + SwissProt + PIR + PRF) at the National 
Center for Biotechnology Information (Bethesda, MD; 

10 <http://www.ncbi.nlm.nih.gov>) was searched for sequences that were similar 
or identical to MTSP4 using the search algorithm tblastn. The tblastn algorithm 
compares a protein query sequence against a nucleotide sequence database 
dynamically translated in all reading frames. A protein (55 amino acids), which 
has 60% identity with the query MTSP4 sequence (55 amino acids), was 

15 obtained from the translation of genomic sequence of AC01 5555 (nucleotide 
#15553 to 15717). This protein hereafter is referred to as MTSP6. 
Subsequent search of the GenBank database showed that no cDNA encoding 
MTSP6 has been deposited. 

The gene exhibiting highest homology to MTSP6 was human 

20 transmembrane serine protease 2 (GenBank accession number U75329; 

Swissprot accession number 015393), which showed 66% identity to MTSP6 
within the 45 amino acid regions compared. Consequently, the nucleotide 
sequence encoding the MTSP6 protease domain was obtained by comparing the 
protein sequence of human transmembrane serine protease 2 protease domain 

25 with the nucleotide sequence of AC015555 translated in six reading frames. 
The protein sequence obtained from the translated nucleotide sequence of 
MTSP6 revealed an overall 50% identity with human transmembrane serine 
protease 2. A search of the EST database indicated the presence of seven 
MTSP6 EST clones (AA883068, AW591433, AI978874, AI469095, AI935487, 

30 AA534591 and AI758271). 
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Cloning of human MTSP6 full-length cDNA 

To obtain cDNA encoding the region of the MTSP6 protease domain 
identified by database searches described above, two gene-specific primers, 
CM7-NSP-1, 5'-TCACGCATCGTGGGTGGAACATGTCC-3' (5' primer) SEQ ID 
5 NO. 30 nd Chi 7-NSP-2AS, 5'- ACCCACCTCCATCTGCTCGTGGATCC-3' SEQ 
ID NO. 31 (3' primer), were used for PCR. These two primers amplified a 
708-base pair fragment from human mammary gland carcinoma cDNA (Clontech 
Marathon-Ready cDNA, Cat. No. 7493-1). 

To obtain the remaining, unknown cDNA of MTSP6, 5'- and 3'-RACE 
10 reactions were performed on the human mammary gland carcinoma. 

Marathon-Ready cDNA is specifically made for RACE reactions. The first RACE 
reactions were performed by PCR using Marathon cDNA adaptor primer 1 (AP1) 
with gene specific primers, Ch1 7-NSP-2AS, 5'- 

ACCCACCTCCATCTGCTCGTGGATCC-3' SEQ ID NO. 31 for 5'-RACE reaction 
15 and CM7-NSP-1, 5'-TCACGCATCGTGGGTGGAACATGTCC-3' SEQ ID NO. 30 
for 3'-RACE reaction. The PCR products were purified from agarose gel. A 
second nested PCR was then performed using Marathon cDNA adaptor primer 2 
(AP2) with gene specific primer Ch1 7-NSP-3AS, 

5'- CCACAGCCTCCTCTCTTGACACACCAG-3' SEQ ID No. 32 for 5'-RACE 
20 reaction (using first 5'-RACE product as template) and CM7-NSP-3 

5'-ACGCCCCTGTGGATCATCACTGCTGC-3' SEQ ID No. 33 for 3'-RACE 
reaction (using first 3'-RACE product as template). First 5'- and 3'-RACE 
products were also used as template for PCR reactions using primers 
CM7-NSP-3 and CM7-NSP-4AS to obtain a cDNA fragment for use as a probe. 
25 PCR products from RACE reactions which were larger than 700 bp were cut out 
and purified from agarose gel and subcloned into pCR2.1-TOPO cloning vector 
(Invitrogen, Carlsbad, CA). Colony hybridization was then performed to identify 
positive colonies containing MTSP6 sequence. Positive clones were identified 
by colony hybridization using the 495 bp DNA fragment obtained from PCR 
30 reaction (with primers CM7-NSP-3 and CM7-NSP-4AS) and by DNA 
sequencing. 
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Sequence analysis of the B'-RACE products indicated that an additional 
420 bp of upstream sequence were obtained. The potential initial codon was 
not present in the 5 '-RACE sequence. Another round of nested 5'-RACE 
reaction was performed using AP2 and a gene specific primer {designed based 
5 on the new RACE sequence) Ch17-NSP-5AS 

5'-TCCCTCCCTCACATATACTGAGTGGTG-3' SEQ ID No. 34, using the PCR 
products obtained from the first B'-RACE as template. A PCR product of 367 bp 
using CM7-NSP-6 5'-CGACTGCTCAGGGAAGTCAGATGTCG-3' SEQ ID NO. 35 
(designed based on the new B'-RACE sequence) and Ch17-NSP-5AS was used 

10 to identify the positive clones. An additional sequence of 480 bp was obtained 
from the second 5'-RACE products. A potential ATG start codon was observed 
within a sequence of GTCACCATGG (nucleotides 262-272 of SEQ ID No. 1 2, 
which appears to be a Kozak sequence (GCC (A/G) CCAUGG), indicating that 
this ATG is likely the initiation codon for MTSP6. 

15 The 3'-RACE reaction to obtain the rest of the 3' end of MTSP6 was not 

successful using Marathon Ready human mammary gland carcinoma cDNA. The 
sequence of the 3'-RACE products obtained was exclusively that of an MTSP6 
cDNA truncated with the Marathon AP2 primer sequence within the coding 
region. 

20 The 3'-end sequence of MTSP6 was obtained by PCR using Ch17-NSP-3 

(5'-ACGCCCCTGTGGATCATCACTGCTGC-3'; SEQ ID NO. 33) and CM7-NSP-4 
(5'-CTGGTGTGTCAAGAGAGGAGGCTGTGG-3'; SEQ ID NO. 37) with an 
antisense primer Ch1 7-NSP-7AS 

(5'-ACTCAGGTGGCTACTTATCCCCTTCCTC-3'; SEQ ID NO. 38) designed based 
25 on the sequence of an EST clone AA883068, which apparently covers the 
3'-end of MTSP6 sequence, and human small intestine cDNA (Clontech) as 
template. Two PCR products (650 bp and 1 82 bp, respectively) were obtained 
and DNA sequence analysis indicated that both PCR products contained a stop 
codon. 

30 Sequence analysis and domain organization of MTSP6 

The MTSP6 DNA and protein sequences were analyzed using DNA Strider 
(version 1.2). The ORF of MTSP6 is composed of 1,362 bp, which translate 



WO 01/57194 



PCT/US01/03471 



-157- 

into a 453-amino acid protein. Protein sequence analysis using the SMART 
(Simple Modular Architecture Research Tool) program at 

http://smart.embl-heidelberg.de predicts that MTSP6 is a multi-domain, type-ll 
membrane-type serine protease containing of a transmembrane domain (amino 
5 acids 48-68) at the N terminus followed by a LDLRa domain (LDL receptor 

domain class a) (amino acids 72-108), a SR domain {Scavenger receptor Cys-rich 
domain)(amino acids 109-205), and a trypsin-like serine protease domain (amino 
acids 216-443) (see FIGURE 3). 

Gene expression profile of MTSP6 in normal and tumor tissues 

10 To obtain information regarding the gene expression profile of the MTSP6 

transcript, a 495 bp DNA fragment obtained from PCR reaction with primers 
CM7-NSP-3 and NSP-4AS was used to probe an RNA blot composed of 76 
different human tissues (catalog number 7775-1; human multiple tissue 
expression (MTE) array; CLONTECH). The strongest signal was observed in 

15 duodenum. Signal in other tissues were observed in (decreased signal level): 
Stomach > trachea = mammary gland = thyroid gland = salivary gland = 
pituitary gland = pancreas > kidney > lung > jejunum = ileum = ilocecum = 
appendix = fetal kidney > fetal lung. Very weak signals can also be detected 
in several other tissues. MTSP6 is also expressed in several tumor cell lines 

20 including HeLa S3 > colorectal adenocarcinoma (SW480) > leukemia MOLT-4 
> leukemia K-562. PCR analysis of the MTSP6 transcript from cDNA libraries 
made from several human primary tumors xenografted in nude mice (human 
tumor multiple tissue cDNA panel, catalog number K1 522-1, CLONTECH) was 
performed using MTSP6-specific primers (CM7-NSP-3 and Ch1 7-NSP2AS). The 

25 MTSP6 transcript was strongly detected in lung carcinoma (LX-1 ), moderately 
detected in pancreatic adenocarcinoma (GI-103), weakly detected in ovarian 
carcinoma (GI-102); and very weakly detected in colon adenocarcinoma (GI-1 12 
and CX-1), breast carcinoma (GI-1 01), lung carcinoma (GI-1 17) and prostatic 
adenocarcinoma (PC3). The MTSP6 transcript was also detected in breast 

30 cancer cell line MDA-MB-231 , prostate cancer cell line PC-3, but not in HT-1080 
human fibrosarcoma cell line. MTSP6 is also expressed in mammary gland 
carcinoma cDNA (Clontech). 
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EXAMPLE 4 
Expression of the protease MTSP domains 

The DNA encoding each of the MTSP 3 and 4 protease domains was 
cloned into a derivative of the Pichia pastoris vector pPIC9K (available from 
5 Invitrogen; see SEQ ID NO. 45). Plasmid pPIC9k features include the 5' AOX1 
promoter fragment at 1-948; 5' AOX1 primer site at 855-875; alpha-factor 
secretion signal(s) at 949-1 218; alpha-factor primer site at 1 1 52-1 172; multiple 
cloning site at 1 192-1241; 3' AOX1 primer site at 1327-1347; 3' AOX1 
transcription termination region at 1253-1586; HIS4 ORF at 45 14-1 980; 
10 kanamycin resistance gene at 5743-4928; 3' AOX1 fragment at 6122-6879; 
ColE1 origin at 7961-7288; and the ampicillin resistance gene at 8966-8106. 
The plasmid used herein is derived from pPIC9K by eliminating the Xhol site in 
the kanamycin resistance gene and the resulting vector is herein designated 
pPIC9KX. 

1 5 Primers used for PCR amplification of protease domain and subcloning 

into the XhoI/NotI sites of Pichia vector 

MTSP3 

5' primer (with Xhol site [underlined]) SEQ ID No. 39 
5' TC TCTCGAG AAAAGAGTGGTGGGTGGGGAGGAGGCCTCTGTG 3' 
20 3' primer (with Notl site [underlined]) SEQ ID No. 40 

5' ATTC GCGGCCGC ATTACAGCTCAGCCTTCCAGAC 3' 

MTSP4-S and MTSP4-L 
5' primer (with Xhol site [underlined]) SEQ ID No. 41 

5' TCT CTCGAG AAAAGAATTGTTGGTGGAGCTGTGTCCTCCGAG 

25 3' primer with Notl site SEQ ID No. 42 

5' ATTC GCGGCCGCT CAGGTCACCACTTGCTGGATCCAG 3' 
MTSP6 

MTSP6 was cloned into the E. coli TOPO vector (pcR® 2.1 TOPO™, SEQ 
ID No. 46, Invitrogen, Carlsbad, CA; the TOPO® TA Cloning® Kit is designed 
30 form cloning Taq-amplified PRCR products). 

5' primer (with Xhol site [underlined]) SEQ ID No. 43 

5' CTCGAGAAACGCATCGTGGGTGGAAACATGTCCTTG 3' 
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3' primer Notl site comes from E. co!i TOPO vector SEQ ID No. 44: 
5' ACTCAGGTGGCTACTTATCCCCTTCCTC 3' 

EXAMPLE 5 

Assays for identification of candidate compounds that modulate that activity of 
5 an MTSP 

Assay for identifying inhibitors 

The ability of test compounds to act as inhibitors of catalytic activity of 
an MTSP, including MTSP1, MTSP3, MTSP4, MTSP6 can be assessed in an 
amidolytic assay. The inhibitor-induced inhibition of amidolytic activity by a 

10 recombinant MTSP or the protease domain portions thereof, can be measured by 
IC50 values in such an assay. 

An exemplary assay buffer is HBSA (10 mM Hepes, 150mM sodium 
chloride, pH 7.4, 0.1% bovine serum albumin). All reagents were from Sigma 
Chemical Co. (St. Louis, MO), unless otherwise indicated. Two IC50 assays at 

15 30-minute (a 30-minute preincubation of test compound and enzyme) and at 
0-minutes (no preincubation of test compound and enzyme) are conducted. For 
the IC50 assay at 30-minute, the following reagents are combined in appropriate 
wells of a Corning microtiter plate: 50 microliters of HBSA, 50 microliters of the 
test compound, diluted (covering a broad concentration range) in HBSA (or 

20 HBSA alone for uninhibited velocity measurement), and 50 microliters of the 
MTSP or protease domain thereof diluted in buffer, yielding a final enzyme 
concentration of about 100-500 pM. Following a 30-minute incubation at 
ambient temperature, the assay is initiated by the addition of 50 microliters of a 
substrate for the particular MTSP (see, e.g., table and discussion below) and 

25 reconstituted in deionized water, followed by dilution in HBSA prior to the assay) 
were added to the wells, yielding a final volume of 200 microliters and a final 
substrate concentration of 300 //M (about 1.5-times Km). 

For an IC50 assay at 0-minute, the same reagents are combined: 50 
microliters of HBSA, 50 microliters of the test compound, diluted (covering the 

30 identical concentration range) in HBSA (or HBSA alone for uninhibited velocity 
measurement), and 50 microliters of the substrate, such as a chromogenic 
substrate. The assay is initiated by the addition of 50 microliters of MTSP. The 
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final concentrations of all components are identical in both IC50 assays (at 30- 
and 0-minute incubations). 

The initial velocity of the substrate hydrolysis is measured in both assays 
by, for example for a chromogenic substrate, as the change of absorbance at a 
5 particular wavelength, using a Thermo Max£ Kinetic Microplate Reader 

(Molecular Devices) over a 5 minute period, in which less than 5% of the added 
substrate was used. The concentration of added inhibitor, which caused a 50% 
decrease in the initial rate of hydrolysis was defined as the respective IC50 value 
in each of the two assays (30-and 0-minute). 

10 Another assay for identifying inhibitors 

Test compounds for inhibition of the protease activity of the protease 
domain of is assayed in Costar 96 well tissue culture plates (Corning NY). 
Approximately 2-3 nM the MTSP or protease domain thereof is mixed with 
varying concentrations of inhibitor in 29.2 mM Tris, pH 8.4, 29.2 mM imidazole, 

15 217 mM NaCI (100 mL final volume), and allowed to incubate at room 

temperature for 30 minutes. 400 mM substrate is added, and the reaction 
monitored in a SpectraMAX Plus microplate reader (Molecular Devices, 
Sunnyvale CA) by following the change in a parameter correlated with 
hydrolysis, such as absorbance for a chromogenic substrate for 1 hour at 37° C. 

20 

ASSAY FOR SCREENING MTSP6 

The protease domain of MTSP6 expressed in Pichia pastoris is assayed 
for inhibition by various compounds in Costar 96 well tissue culture plates 
(Corning NY). Approximately 1-20 nM MTSP6 is mixed with varying 

25 concentrations of inhibitor in 29.2 mM Tris, pH 8.4, 29.2 mM Imidazole, 217 

mM NaCI (100 pL final volume), and allowed to incubate at room temperature for 
30 minutes. 500 //M substrate Spectrozyme t-PA (American Diagnostica,, 
Greenwich, CT) is added, and the reaction is monitored in a SpectraMAX Plus 
microplate reader (Molecular Devices, Sunnyvale CA) by measuring the change 

30 in absorbance at 405 nm for 30 minutes at 37 °C. 
Identification of substrates 
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Particular substrates for use in the assays can be identified empirically by 
testing substrates. The following list of substrates are exemplary of those that 
can be tested. 



Substrate name 


Structure 


S 2366 


pyroGlu-Pro-Arg-pNA.HCI 


spectrozyme t-PA 


CH 3 S0 2 -D-HHT-Gly-Arg-pNA.AcOH 


N-p-tosyl-Gly-Pro-Arg-pNA 


N-p-tosyl-Gly-Pro-Arg-pNA 


Benzoyl- Val-G ly-Arg-pNA 


Benzoyl- Val-Gly-Arg-pIMA 


Pefachrome t-PA 


CH 3 S0 2 -D-HHT-Gly-Arg-pNA 


S 2765 


N-a-Z-D-Arg-Gly-Arg-pNA.2HCI 


S 2444 


pyroGlu-Gly-Arg-pNA.HCI 


S 2288 


H-D-lle-Pro-Arg-pNA.2HCI 


spectrozyme UK 


Cbo-L-(K)Glu(a-t-BuO)-Gly-Arg-pNA.2AcOH 


S 2302 


H-D-Pro-Phe-Arg-pNA.2HCI 


S 2266 


H-D-Val-Leu-Arg-pNA.2HCI 


S 2222 


Bz-lle-Glu(g-OR)-Gly-Arg-pNA.HCI 
R = H(50%) and R = CH 3 (50%) 


Chromozyme PK 


Benzoyl-Pro-Phe-Arg-pNA 


S 2238 


H-D-Phe-Pip-Arg-pNA.2HCI 


S 2251 


H-D-Val-Leu-Lys-pNA.2HCI 


Spectrozyme PI 


H-D-Nle-HHT-Lys-pNA.2AcOH 




Pyr-Arg-Thr-Lys-Arg-AMC 




H-Arg-Gln-Arg-Arg-AMC 




Boc-Gln-Gly-Arg-AMC 




Z-Arg-Arg-AMC 


Spectrozyme THE 


H-D-HHT-Ala-Arg-plMA.2AcOH 


Spectrozyme fXlla 


H-D-CHT-Gly-Arg-pNA.2AcOH 




CVS 2081-6 (MeSO r dPhe-Pro-Arg-pNA) 




Pefachrome fVlla {CH 3 S0 2 -D-CHA-But-Arg-pNA) 



10 



15 



20 



25 



pNA = para-nitranilide (chromogenic) 
AMC = amino methyl coumarin (fluorescent) 

If none of the above substrates are cleaved, a coupled assay, described 

above, can be used. Briefly, test the ability of the protease to activate and 

enzyme, such as plasminogen and trypsinogen. To perform these assays, the 

single chain protease is incubated with a zymogen, such as plasminogen or 

trypsinogen, in the presence of the a known substrate, such, lys-plasminogen. 



30 



35 
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for the zymogen. If the single chain activates the zymogen, the activated 
enzyme, such as plasmin and trypsin, will degrade the substrate therefor. 

EXAMPLE 6 

Isolation and Cloning of Matriptase 
5 A. Cell type and growth of cells 

Human prostate adenocarcinoma cell line, PC-3, was purchased from 
ATCC {catalog number CRL-1435; Manassas, VA). The cells were cultured at 
37°C, 5% C0 2 in Ham's F-12K growth medium (catalog number 9077; Irvine) 
supplemented with 2 mM L-glutamine and 10% fetal bovine serum. All 
10 subsequent cell manipulations were carried out according to the manufacturer's 
instructions. PC-3 cells were allowed to grow to about 90% confluence, and 
were then washed briefly with 1x phosphate buffered saline. 

B. Isolation of total RNA, and purification and enrichment of polyA + 

RNA 

15 PC-3 cells were lysed in Trizol reagent (catalog number 15596; Life 

Technologies, Rockville, MD) and total RNA was isolated according to the 
manufacturer's protocol. The concentration of total RNA was estimated from 
absorbance reading at 260 nm. Poly A* 4 " RNA was purified and enriched using 
oligo-dT beads (catalog number 70061; Oligotex, Qiagen, Valencia, CA). 

20 C. Reverse-transcription and polymerase chain reaction (PCR) 

PC-3-derived polyA + RNA was converted to single-stranded cDNA 
(sscDNA) by reverse transcription using ProSTAR first-strand RT-PCR kit (catalog 
number 200420; Stratagene, La Jolla, CA) and Superscript II RNase H* reverse 
transcriptase (catalog number 18064-022; Life Technologies). After reverse 

25 transcription, an aliquot of PC-3 sscDNA (4 //L) was subjected to PCR using 2 
mM each of the sense and anti-sense degenerate oligonucleotide primers a Q nd 
Taq polymerase (catalog number 201203; Qiagen). Total reaction volume was 
100)t/L. The sequence of the sense primer was 5'- 

TGGRT(l)VT(l)WS(l)GC(l)RC(l)CAYTG-3' (SEQ ID No. 13) and that of the anti- 
30 sense was 5'(l)GG(l)CC(l)CC(l)SWRTC(l)CCYT(l)RCA(l)GHRTC-3' (SEQ ID 

No. 14), where R = A,G; V = G,A,C; W = A,T; S = G,C; Y = C,T; H=A,T,C. The 
primer sequences correspond to two highly conserved regions in all 
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chymotrypsin-like serine proteases and amplify PCR products ranging from 
approximately 400 to 500 base pairs. 

D. Clone screening and sequencing 

The PCR products were separated on a 2% agarose gel and purified using 
5 a gel extraction kit (catalog number 28706; QIAquick gel extraction kit; Qiagen). 
The purified DNA fragments were ligated into pCR2.1-TOPO (catalog number 
K4500-01; Invitrogen, Carlsbad, CA). After transformation into E. cofi cells, 
plasmid DNA was isolated and analyzed by digestion with EcoRI restriction 
enzyme. Clones that had inserted nucleic acid were further characterized by 

10 sequencing using a fluorescent dye-based DNA sequencing method (catalog 
number 4303149; BigDye terminator cycle sequencing kit with AmpliTaQ DNA 
polymerase; Perkin Elmer, Lincoln, CA). A total of 31 clones were sequenced 
and analyzed. All sequences were analyzed by a multiple nucleotide sequence 
alignment algorithm (blastn) (www.ncbi.nlm.nih.gov/blast) to identify identical or 

15 closely related DNA deposited in GenBank (NCBI, Bethesda, MD). Those that did 
not show, significant homology were further analyzed using blastx, which 
compares the six-frame conceptual translation products of a nucleotide sequence 
(both strands) against a protein sequence database (SwissProt). Eight clones 
yielded identical cDNA fragments that encode MTSP1 . MTSP1 was 

20 subsequently found to be identical to matriptase (GenBank accession number 
AF1 18224). 

E. Rapid amplification of cDNA ends (RACE) and gene-specific 
amplification of MTSP1 

To obtain DNA encoding the complete protease domain of MTSP1, RACE 

25 and gene-specific amplification reactions were performed. A human prostate 

Marathon-Ready cDNA (catalog # 7418-1; Clontech) was used to isolate part of 

the cDNA encoding MTSP1 . Marathon-Ready cDNA is prepared to contain a 

known hybridization sequence at the 5' and 3' ends of the sscDNA. The 3' 

region of MTSP1 cDNA was obtained by a 3'-RACE reaction using a gene 

30 specific primer, 5'-CACCCCTTCTTCAATGACTTCACCTTCG-3' (SEQ ID No. 55). 

The 5' end of the MTSP1 protease domain was obtained by gene-specific 

amplification reaction using two MTSP1 -specific primers, 5'- 
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TACCTCTCCTACGACTCC-3' (SEQ ID No. 56) for the sense primer and 5'- 

GAGGTTCTCGCAGGTGGTCTGGTTG-3' (SEQ ID No. 57) for the antisense 

primer. The sequences for these two primers were obtained from the human 

SNC19 mRNA sequence. The 3'-RACE reaction and gene-specific PCR produced 

5 DNA fragments that were > 1 kbp in size. These fragments were subcloned into 

pCR2.1-TOPO (Invitrogen, San Diego, CA). After transformation into E. coli 

cells, plasmid DNA was isolated and analyzed by digestion with EcoRI restriction 

enzyme. Clones that had inserts were characterized by Southern blot analysis 

{using the internal cDNA fragment as probe) and by DNA sequence analysis. 

10 F. PCR amplification of cDNA encoding the protease domain of 

MTSP1 

To obtain a cDNA fragment encoding the entire protease domain of 
MTSP1, an end-to-end PCR amplification using gene-specific primers was used. 

The two primers used were: 5'- 

15 CTCGAGAAAAGAGTTGTTGGGGGCACGGATGCGGATGAG-3' (SEQ ID No. 58) 
for the 5' end and 5'-GCGGCCGCACTATACCCCAGTGTTCTCTTTGATCCA-3' 
{SEQ ID No. 36 for the 3' end. The 5' primer contained the sequence that 
encodes the start of the MTSP1 protease domain (WGGTDADE) (SEQ. ID. 
NO. 10). The 3' primer contained the stop codon of MTSP1 . A ~800-bp 

20 fragment was amplified, purified and subcloned into the Pichia pastoris 
expression vector, pPIC9K, resulting in pPIC9K-MTSP1 . 

G. Gene expression profile of MTSP1 in normal tissues, cancer ceils 
and cancer tissues 

To obtain information regarding the tissue distribution and gene 

25 expression level of MTSP1, the DNA insert from pPIC9K-MTSP1 was used to 

probe a blot containing RNA from 76 different human tissues (catalog number 

7775-1; human multiple tissue expression (MTE) array; CLONTECH, Palo Alto, 

CA). Significant expression was observed in the colon {ascending, transverse 

and descending), rectum, trachea, esophagus and duodenum. Moderate 

30 expression levels were observed in the jejunum, ileum, ilocecum, stomach, 

prostate, pituitary gland, appendix, kidney, lung, placenta, pancreas, thyroid 

gland, salivary gland, mammary gland, fetal kidney, and fetal lung. Lower 
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expression levels were seen in the spleen, thymus, peripheral blood leukocyte, 
lymph node, bone marrow, bladder, uterus, liver, adrenal gland, fetal heart, fetal 
liver, fetal spleen, and fetal thymus. A significant amount of the MTSP1 
transcript was also detected in colorectal adenocarcinoma cell line (SW480), 
5 Burkitt's lymphoma cell line (Daudi), and leukemia cell line (HL-60). RT-PCR of 
the MTSP1 transcript in several human primary tumors xenografted in athymic 
nude mice was performed using gene-specific primers. A high level of MTSP1 
transcript was detected in colon adenocarcinoma (CX-1) and pancreatic 
adenocarcinoma (GI-103). Moderate levels were observed in another colon 

10 adenocarcinoma (GI-112), ovarian carcinoma (GI-102), lung carcinoma (LX-1), 
and breast carcinoma (GI-101). Another lung carcinoma (GI-117) expressed a 
low level of the MTSP1 transcript. A similar RT-PCR was performed to detect 
the presence of the MTSP1 transcript in PC-3 and LNCaP cell lines. Both cell 
lines expressed significant amounts of MTSP1 transcript. 

15 H. Sequence analysis 

All derived DNA and protein sequences were analyzed using MacVector 
(version 6.5; Oxford Molecular Ltd., Madison, Wl). The cDNA encoding the 
protease domain of MTSP1 is composed of 726 base pairs which translate into a 
241-amino acid protein sequence (rMAP) (see SEQ ID No. 1 , 2, 49 and 50). 

20 EXAMPLE 7 

Production of Recombinant Serine Protease Domain of Matriptase or MTSP1 
(rMAP) 

A. Fermentation 

The production of multi-milligram amounts of rMAP was carried out by 
25 fermentation in a BioFlo 3000 fermentor (New Brunswick Scientific, NJ) 

equipped with a 3.3 L capacity bioreactor using a SMD1 1 68/pPIC9K:MTSP1 Sac 
SC1 clone. ZA001 complex media (10 g/L yeast extract, 20 g/L peptone, 40 g/L 
glycerol, 5 g/L ammonium sulfate, 0.2 g/L calcium sulfate dihydrate, 2 g/L 
magnesium sulfate heptahydrate, 2 g/L potassium sulfate, 25 g/L sodium 
30 hexametaposphate, 4.35 ml/L PTM1) was inoculated with 100 ml of an 

overnight culture of the P. pastoris transformant. The culture was supplemented 
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witb 50% glycerol by fed-batch phase and induced for 1 8-24 hours with 
methanol controlled at 0.025%. 

B. Purification of Recombinant Serine Protease Domain of Matriptase 
or W1TSP1 (rMAP) 

5 The rMAP was secreted into the culture medium, so the first step of the 

purification involved the removal of cells and cell debris by centrifugation at 
5000 g for 30 minutes. The resulting supernatant was decanted, adjusted to pH 
8.0 with 10 N NaOH, and filtered through a SartoBran 300 0.45 + 0.2 //M 
capsule. This supernatant was concentrated to 1 L by ultrafiltration using a 10 
10 kDa ultrafiltration cartridge (NC SRT UF system with AG/Technologies 

UFP-10-C-5A filter), and the buffer was exchanged by crossflow filtration into 
50 mM tris-HCI, 50 mM NaCI, 0.05% tween-80, pH 8.0 (buffer A). The 
filtration unit was rinsed once with 1 L buffer A which was combined with the 
concentrate. 

15 The concentrated rMAP-containing solution was passed over a 150 ml 

benzamidine column that had been equilibrated with buffer A, at a flow rate of 8 
ml/min. The column was washed with 3 column volumes of 50 mM tris-HCI, 
1.0 M NaCI, 0.05% tween-80, pH 8.0 (buffer B) and eluted with 3 column 
volumes of 50 mM tris-HCI, 1.0 M L-arginine, 0.05% tween-80, pH 8.0 (buffer 

20 C). Fractions containing rMAP were identified by activity assay and pooled. 

This pooled material was concentrated to 10 ml using a JumboSep concentrator 
(Pall Gelman) and a 10 kDa cutoff membrane. Once concentrated to 10 ml, the 
buffer was exchanged into 50 mM Na 2 HP0 4 , 125 mM NaCI, pH 5.5 (buffer D) 
and the volume adjusted to 5-10 ml. The retentate was removed and the 

25 concentrator washed with buffer D which was added to the concentrate- The 
total sample volume was adjusted 15 ml. 

The partially purified rMAP was passed through a 5 ml Q-sepharose Fast 
Flow HiTrap column (Amersham-Pharmacia Biotech) pre-equilibrated with 1 5 ml 
of buffer D. The flow through was collected. The HiTrap column was washed 

30 with an additional 10 ml of buffer D. Both flow throughs were pooled, and the 
protein concentration was determined by measurement of OD 280 (using an 
extinction coefficient of 2.012 mg/OD 280 ). Purified rMAP was then 
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deglycosylated by the addition 0.1 //I of Endoglycosidase H {ProZyme, 5 U/ml) 
per mg of protein and incubating overnight at 4°C with gentle swirling. 

The conductivity of the deglycosylated pool was adjusted to 2.0-3.0 
mS/cm with Nanopure H z O and the pH adjusted to 6.5 (-200-3OO mL final 
5 volume). The rMAP was then further purified by anion exchange 

chromatography by loading directly onto a Pharmacia Akta Explorer system using 
a 7 mL Source 15Q anion exchange column (Amersham-Pharmacia Biotech). 
The protein was eluted in a buffer containing 50 mM HEPES, pH 6.5 with a 0- 
0.33 M NaCI gradient over 10 column volumes at a flow rate of 6 ml/min. 
10 Fractions containing protein were pooled, and benzamidine was added to a final 
concentration of 10 mM. Protein purity was examined by SDS-PAGE and protein 
concentration determined by measurement of OD 280 and use of a theoretical 
extinction coefficient of 2.012 mg/OD 280 . 

EXAMPLE 8 

1 5 Assays 

Amidolytic Assay for Determining Inhibition of Serine Protease 
Activity of Matriptase or MTSP1 

The ability of test compounds to act as inhibitors of rMAP catalytic 
activity was assessed by determining the inhibitor-induced inhibition of 

20 amidolytic activity by the MAP, as measured by IC 50 values. The assay buffer 
was HBSA (10 mM Hepes, 150mM sodium chloride, pH 7.4, 0.1% bovine serum 
albumin). All reagents were from Sigma Chemical Co. (St. Louis, MO), unless 
otherwise indicated. 

Two IC 50 assays (a) one at either 30-minutes or 60-minutes (a 30-minute 

25 or a 60-minute preincubation of test compound and enzyme) and (b) one at 
0-minutes (no preincubation of test compound and enzyme) were conducted. 
For the IC S0 assay at either 30-minutes or 60-minutes, the following reagents 
were combined in appropriate wells of a Corning microtiter plate: 50 microliters 
of HBSA, 50 microliters of the test compound, diluted (covering a broad 

30 concentration range) in HBSA (or HBSA alone for uninhibited velocity 

measurement), and 50 microliters of the rMAP (Corvas International) diluted in 
buffer, yielding a final enzyme concentration of 250 pM as determined by active 



WO 01/57194 



PCT/US01/03471 



-168- 

site filtration. Following either a 30-minute or a 60-minute incubation at ambient 
temperature, the assay was initiated by the addition of 50 microliters of the 
substrate S-2765 (N-a-Benzyloxycarbonyl-D-arginyl-L-glycyl-L-arginine-p- 
nitroaniline dihydrochloride; DiaPharma Group, Inc.; Franklin, OH) to each well, 
5 yielding a final assay volume of 200 microliters and a final substrate 

concentration of 100 //M (about 4-times K m ). Before addition to the assay 
mixture, S-2765 was reconstituted in deionized water and diluted in HBSA. For 
the IC 50 assay at 0 minutes; the same reagents were combined: 50 microliters of 
HBSA, 50 microliters of the test compound, diluted (covering the identical 

10 concentration range) in HBSA (or HBSA alone for uninhibited velocity 

measurement), and 50 microliters of the substrate S-2765. The assay was 
initiated by the addition of 50 microliters of rMAP. The final concentrations of 
all components were identical in both IC 50 assays (at 30- or 60- and 0-minute). 
The initial velocity of chromogenic substrate hydrolysis was measured in 

15 both assays by the change of absorbance at 405 nM using a Thermo Max® 

Kinetic Microplate Reader (Molecular Devices) over a 5 minute period, in which 
less than 5% of the added substrate was used. The concentration of added 
inhibitor, which caused a 50% decrease in the initial rate of hydrolysis was 
defined as the respective IC 50 value in each of the two assays (30- or 

20 60-minutes and 0-minute). 

in vitro enzyme assays for specificity determination 
The ability of compounds to act as a selective inhibitor of matriptase 
activity was assessed by determining the concentration of test compound that 
"inhibits the activity of matriptase by 50%, (IC 50 ) as described in the above 

25 Example, and comparing IC 50 value for matriptase to that determined for all or 
some of the following serine proteases: thrombin, recombinant tissue 
plasminogen activator (rt-PA), plasmin, activated protein C, chymotrypsin, factor 
Xa and trypsin. 

The buffer used for all assays was HBSA (10 mM HEPES, pH 7.5, 150 
30 mM sodium chloride, 0.1 % bovine serum albumin). The assay for IC 50 

determinations was conducted by combining in appropriate wells of a Corning 
microtiter plate, 50 microliters of HBSA, 50 microliters of the test compound at 
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a specified concentration (covering a broad concentration range) diluted in HBSA 
(or HBSA alone for V 0 (uninhibited velocity) measurement), and 50 microliters of 
the enzyme diluted in HBSA. Following a 30 minute incubation at ambient 
temperature, 50 microliters of the substrate at the concentrations specified 
5 below were added to the wells, yielding a final total volume of 200 microliters. 
The initial velocity of chromogenic substrate hydrolysis was measured by the 
change in absorbance at 405 nm using a Thermo Max® Kinetic Microplate Reader 
over a 5 minute period in which less than 5% of the added substrate was used. 
The concentration of added inhibitor which caused a 50% decrease in the initial 

10 rate of hydrolysis was defined as the IC 50 value. 
Thrombin (f Ha) Assay 

Enzyme activity was determined using the chromogenic substrate, 
Pefachrome t-PA (CH 3 S0 2 -D-hexahydrotyrosine-glycyl-L-Arginine-p-nitroaniline, 
obtained from Pentapharm Ltd.). The substrate was reconstituted in deionized 

15 water prior to use. Purified human a-thrombin was obtained from Enzyme 

Research Laboratories, Inc. .The buffer used for all assays was HBSA (10 mM 
HEPES, pH 7.5, 150 mM sodium chloride, 0.1% bovine serum albumin). 

IC 50 determinations were conducted where HBSA (50 M-), a-thrombin (50 
//I) (the final enzyme concentration is 0.5 nM) and inhibitor (50 p\) {covering a 

20 broad concentration range), were combined in appropriate wells and incubated 
for 30 minutes at room temperature prior to the addition of substrate 
Pefachrome-t-PA (50 //I) (the final substrate concentration is 250 pM, about 5 
times Km). The initial velocity of Pefachrome t-PA hydrolysis was measured by 
the change in absorbance at 405 nm using a Thermo Max® Kinetic Microplate 

25 Reader over a 5 minute period in which less than 5% of the added substrate was 
used. The concentration of added inhibitor which caused a 50% decrease in the 
initial rate of hydrolysis was defined as the IC 50 value. 
Factor Xa 

Factor Xa catalytic activity was determined using the chromogenic 
30 substrate S-2765 (N-benzyloxycarbonyl-D-arginine-L-glycine-L-arginine-p-nitro- 
aniline), obtained from DiaPharma Group (Franklin, OH). All substrates were 
reconstituted in deionized water prior to use. The final concentration of S-2765 
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was 250 jjbA (about 5-times Km). Purified human Factor X was obtained "from 
Enzyme Research Laboratories, Inc. (South Bend, IN) and Factor Xa (FXa) was 
activated and prepared from it as described [Bock, P.E., Craig, P. A., Olson, S.T., 
and Singh, P. Arch. Biochem. Biophys. 273:375-388 (1989)]. The enzyme was 
5 diluted into HBSA prior to assay in which the final concentration was 0.25 nM. 
Recombinant tissue plasminogen activator (rt-PA) Assay 

rt-PA catalytic activity was determined using the substrate, Pefachrome 
t-PA (CH 3 S0 2 -D-hexahydrotyrosine-glYcyl-L-arginine-p-nitroanUine, obtained from 
Pentapharm Ltd.). The substrate was made up in deionized water followed by 
10 dilution in HBSA prior to the assay in which the final concentration was 500 
micromolar (about 3-times Km). Human rt-PA (Activase®) was obtained from 
Genentech Inc. The enzyme was reconstituted in deionized water and diluted 
into HBSA prior to the assay in which the final concentration was 1.0 nM. 
Plasmin Assay 

1 5 Plasmin catalytic activity was determined using the chromogenic 

substrate, S-2366 IL-pyroglutamyl-L-prolyl-L-arginine-p-nitroaniline 
hydrochloride], which was obtained from DiaPharma group. The substrate was 
made up in deionized water followed by dilution in HBSA prior to the assay in 
which the final concentration was 300 micromolar (about 2. 5-times Km). 

20 Purified human plasmin was obtained from Enzyme Research Laboratories, Inc. 
The enzyme was diluted into HBSA prior to assay in which the final 
concentration was 1 .0 nM. 

Activated Protein C (aPC) Assay 

aPC catalytic activity was determined using the chromogenic substrate, 
25 Pefachrome PC (delta-carbobenzloxy-D-lysine-L-prolyl-L-arginine-p-nitroaniline 

dihydrochloride), obtained from Pentapharm Ltd.). The substrate was made up 
in deionized water followed by dilution in HBSA prior to the assay in which the 
final concentration was 400 micromolar (about 3-times Km). Purified human 
aPC was obtained from Hematologic Technologies, Inc. The enzyme was diluted 
30 into HBSA prior to assay in which the final concentration was 1 .0 nM. 
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Chymotrypsin Assay 

Chymotrypsin catalytic activity was determined using the chromogenic 
substrate, S-2586 (methoxy-succinyl-L-arginine-L-prolyl-L-tyrosyl-p-nitroanilide), 
which was obtained from DiaPharma Group. The substrate was made up in 
5 deionized water followed by dilution in HBSA prior to the assay in which the final 
concentration was 100 micromolar (about 9-times Km). Purified (3X-crysta1lized; 
CDI) bovine pancreatic alpha-chymotrypsin was obtained from Worthington 
Biochemical Corp. The enzyme was reconstituted in deionized water and diluted 
into HBSA prior to assay in which the final concentration was 0.5 nM. 

10 Trypsin Assay 

Trypsin catalytic activity was determined using the chromogenic 
substrate, S-2222 (benzoyl-L-isoleucine-L-glutamic acid-lgamma-methyl ester]-L- 
arginine-p-nitroanilide), which was obtained from DiaPharma Group. The 
substrate was made up in deionized water followed by dilution in HBSA prior to 
15 the assay in which the final concentration was 250 micromolar (about 4-times 
Km). Purified (3X-crystallized; TRL3) bovine pancreatic trypsin was obtained 
from Worthington Biochemical Corp. The enzyme was reconstituted in deionized 
water and diluted into HBSA prior to assay in which the final concentration was 
0.5 nM. 

20 

Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended claims. 
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WHAT IS CLAIMED IS: 

1 . A substantially purified single chain polypeptide, comprising the 
protease domain of a type-ll membrane-type serine protease (MTSP) or a 
cataiytically active portion thereof, wherein: 

5 the MTSP portion of the protein consists essentially of the 

protease domain of the MTSP or a cataiytically active portion thereof. 

2. The substantially purified polypeptide of claim 1, wherein the 
MTSP is not expressed on endothelial cells. 

3. The substantially purified polypeptide of claim 1 , wherein the 
10 MTSP is not expressed on normal endothelial cells in vivo. 

4. The substantially purified polypeptide of claim 1 , wherein the 
MTSP is a human protein. 

5. The substantially purified polypeptide of claim 1 that consists 
essentially of the protease domain of an MTSP or a cataiytically active portion of 

15 the protease domain. 

6. The substantially purified polypeptide of claim 1 , wherein the 
expression and/or activity of the MTSP in tumor cells differs from its level of 
expression and/or activity in non-tumor cells. 

7. The substantially purified polypeptide of claim 1 , wherein the 
20 MTSP is detectable in a body fluid at a level that differs from its level in body 

fluids in a subject not having a tumor. 

8. The substantially purified polypeptide of claim 1 , wherein: 
the MTSP is present in a tumor; and 

a substrate or cof actor for the MTSP is expressed at levels that differ 
25 from a non-tumor cell in the same tissue. 

9. The substantially purified polypeptide of claim 1 , wherein: 

the MTSP exhibits altered substrate specificity in the tumor compared to its 
specificity in a non-tumor cell in the same tissue. 

10. The substantially purified polypeptide of claim 1, wherein the 
30 MTSP has an N-terminus that comprises IVNG, ILGG, VGLL or ILGG. 

11. The substantially purified polypeptide of claim 1, wherein the 
MTSP is selected from among MTSP1 , MTSP3, MTSP4 and MTSP6. 
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12. The substantially purified polypeptide of claim 1, wherein the 
protease domain comprises the sequence of amino acids set forth as amino acids 
615-855 of SEQ ID No. 2, as amino acids 205-437 of SEQ ID NO. 4, as the 
amino acids in SEQ ID No. 6, or as amino acids 217-443 in SEQ ID No. 12. 
5 13. The substantially purified polypeptide of claim 1 that has at least 

about 40%, 60%, 80%, 90% or 95% sequence identity with a protease domain 
that comprises the sequence of amino acids set forth as amino acids 61 5-855 of 
SEQ ID No. 2, as amino acids 205-437 of SEQ ID NO. 4, as the amino acids in 
SEQ ID No. 6, or as amino acids 217-443 in SEQ ID No. 12. 
10 14. A polypeptide of claim 1, wherein the protease domain portion is 

encoded by a nucleic acid molecule that hybridizes under conditions of high 
stringency along its full length to a nucleic acid molecule comprising a sequence 
of nucleotides set forth in SEQ ID No: 1, 3, 5, 7, 9 or 1 1 or to a molecule that 
encodes the protein set forth in SEQ ID No: 2, 4, 6 f 8, 10 or 1 2 or at least one 

15 domain thereof. 

15. A nucleic acid molecule, comprising a sequence of nucleotides that 

encodes the polypeptide of claim 1 . 

16. A mutein of the polypeptide of claim 1 , wherein: 

up to about 90% of the amino acids are replaced with another amino 

20 acid; 

and the resulting polypeptide is a single chain and has catalytic activity at 
least 10% of the unmutated polypeptide. 

17. The mutein of claim 16, wherein up to about 95% of the amino 

acids are replaced. 

25 18. The mutein of claim 16, wherein the resulting polypeptide is a 

single chain and has catalytic activity at least 50% of the unmutated 
polypeptide. 

19. A mutein of the polypeptide of claim 1 , wherein a free Cys in the 
protease domain is replaced with another amino acid, whereby the resulting 

30 polypeptide exhibits proteolytic activity. 

20. A mutein of the polypeptide of claim 1 , wherein a free Cys in the 
protease domain is replaced with a serine. 
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21. A vector comprising the nucleic acid molecule of claim 15. 

22. The vector of claim 21 that is an expression vector. 

23. The vector of claim 21 that includes a sequence of nucleotides 
that directs secretion of any protein encoded by a sequence of nucleotides 

5 operatively linked thereto. 

24. The vector of claim 21 that is a Pichia vector or an £. colt vector. 

25. A cell, comprising the vector of claim 21 . 

26. The cell of claim 25 that is a prokaryotic cell. 

27. The cell of claim 25 that is a eukaryotic cell. 

10 28. The cell of claim 25 that is selected from among a bacterial cell, a 

yeast cell, a plant cell, an insect cell and an animal cell. 

29. The cell of claim 25 that is a mammalian cell. 

30. A method for producing a polypeptide that contains a protease 
domain of an MTSP, comprising: 

15 culturing the cell of claim 25 under conditions whereby the encoded 

protein is expressed by the cell; and 

recovering the expressed protein. 

31 . The method of claim" 30, wherein the cell is a pichia cell and the 
protein is secreted into the culture medium. 

20 32 An antisense nucleic acid molecule that comprises at least 14 

contiguous nucleotides or modified nucleotides that are complementary to a 
contiguous sequence of nucleotides in the protease domain of an MTSP of claim 
1; or 

comprises at least 1 6 contiguous nucleotides or modified nucleotides that 
25 are complementary to a contiguous sequence of nucleotides in the protease 
domain of an MTSP of claim 1 . 

33. An antibody that specifically binds to the single chain form of a 
protease domain of the polypeptide of claim 1 , or a fragment or derivative of the 
antibody containing a binding domain thereof, wherein the antibody is a 
30 polyclonal antibody or a monoclonal antibody. 
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34. The polypeptide of claim 1 , wherein the MTSP is selected from 
among corin, MTSP1 , enterpeptidase, human airway trypsin-like protease (HAT), 
MTSP1 , TMPRSS2, and TMPRSS4. 

35. A conjugate, comprising: 
5 a) a protein of claim 1 , and 

b) a targeting agent linked to the protein directly or via a linker. 
142. The conjugate of claim 35, wherein the targeting agent permits 

i) affinity isolation or purification of the conjugate; 

ii) attachment of the conjugate to a surface; 
10 iii) detection of the conjugate; or 

iv) targeted delivery to a selected tissue or ceil. 

36. A combination, comprising: 

a) an inhibitor of the catalytic activity of the polypeptide of claim 1 ; 
and 

15 b) another treatment or agent selected from anti-tumor and anti- 

angiogenic treatments or agents. 

37. The combination of claim 36, wherein the inhibitor and the anti- 
tumor and/or anti-angiogenic agent are formulated in a single pharmaceutical 
composition or each is formulated in separate pharmaceutical compositions. 

20 38. The combination of claim 36, wherein the inhibitor is selected from 

antibodies and antisense oligonucleotides. 

39. A solid support comprising two or more polypeptides of claim 1 
linked thereto either directly or via a linker. 

40. The support of claim 39, wherein the polypeptides comprise an 

25 array. 

41 . The support of claim 39, wherein the polypeptides comprise a 
plurality of different MTSP protease domains, 

42. A method for identifying compounds that modulate the protease 
activity of an MTSP, comprising: 

30 contacting a polypeptide of claim 1 with a substrate proteolytically 

cleaved by the MTSP, and, either simultaneously, before or after, adding a test 
compound or plurality thereof; 
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measuring the amount of substrate cleaved in the presence of the test 
compound; and 

selecting compounds that change the amount cleaved compared to a 
control, whereby compounds that modulate the activity of the MTSP are 
5 identified. 

43. The method of claim 42, wherein the test compounds are small 
molecules, peptides, peptidomimetics, natural products, antibodies or fragments 
thereof. 

44. The method of claim 42, wherein a plurality of the test substances 
10 are screened simultaneously. 

45. The method of claim 42, wherein the change in the amount 
cleaved is assessed by comparing, the amount cleaved in the presence of the test 
compound with the amount in the absence of the test compound. 

46. The method of claim 42, wherein a plurality of the test substances 
15 are screened for simultaneously. 

47. The method of claim .42, wherein a plurality of the polypeptides 
are linked to a solid support, either directly or via a linker. 

48. The method of claim 42, wherein the polypeptides comprise an 

array. 

20 49. The method of claim 42, wherein the polypeptides comprise a 

plurality of different MTSP proteases. 

50. A method of identifying a compound that specifically binds to a 
single chain protease domain of an MTSP, comprising: 

contacting a polypeptide of claim 1 with a test compound or plurality 
25 thereof under conditions conducive to binding thereof; and 

identifying compounds that specifically bind to the MTSP single chain 
protease domain or compounds that inhibit binding of a compound known to 
bind to the MTPS single chain protease domain, wherein the known compound is 
contacted with the polypeptide before, simultaneously with or after the test 
30 compound. 

51. The method of claims 50, wherein the polypeptide is linked either 
directly or indirectly via a linker to a solid support. 
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52. The method of claim 50, wherein the test compounds are small 
molecules, peptides, peptidomimetics, natural products, antibodies or fragments 
thereof. 

53. The method of claim 50, wherein a plurality of the test substances 
5 are screened for simultaneously. 

54. The method of claim 51, wherein a plurality of the polypeptides 

are linked to a solid support. 

55. A substantially purified membrane-type serine protease 3 (MTSP3). 

56. The MTSP3 of claim 43 that is selected from the group consisting 

10 of: 

a polypeptide encoded by the sequence of nucleotides set forth in 

SEQ ID No. 3; 

a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of high stringency to the sequence of nucleotides set 

15 forth in SEQ ID No. 3; 

a polypeptide that comprises the sequence of amino acids set 

forth as amino acids 205-437 of SEQ ID No. 4; 

a polypeptide that comprises a sequence of amino acids having at 
least about 90% sequence identity with the sequence of amino acids set forth in 

20 SEQ ID No. 4; and 

a polypeptide encoded by a splice variant of the sequence of 

nucleotides set forth in SEQ ID No. 3. 

57. A nucleic acid molecule, comprising a sequence of nucleotides that 

encodes the polypeptide of claim 56. 
25 58. A vector comprising the nucleic acid molecule of claim 57. 

59. A cell, comprising the vector of claim 58. 

60. The cell of claim 59 that expresses the MTSP3 on its surface. 

61 . The cell of claim 59 that is a prokaryotic cell. 

62. The cells of claim 59 that is a eukaryotic cell. 

30 63. The cell of claim 59 that is selected from among a bacterial cell, a 

yeast cell, a yeast cell, a plant cell, an insect cell and an animal cell. 
64. The cell of claim 59 that is a mammalian cell. 
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65. The cell of claim 59, wherein the MTSP3 comprises the sequence 
of amino acids set forth in SEQ ID No. 4; or a sequence of amino acids encoded 
by a sequence of nucleotides that hybridizes under conditions of high stringency 
to the sequence of nucleotides set forth in SEQ ID No. 3; or a protein having at 

5 least about 90% sequence identity with the sequence of amino acids set forth in 
SEQ ID No. 4 and retaining protease activity. 

66. A method for producing an MTSP3, comprising: 

culturing the cell of claim 59 under conditions whereby the encoded 
protein is expressed by the cell; and 
10 recovering the expressed protein. 

67. An antisense nucleic acid molecule that comprises at least 14 
contiguous nucleotides or modified nucleotides that are complementary to a 
contiguous sequence of nucleotides in the nucleic acid molecule of claim 57; or 

comprises at least 16 contiguous nucleotides or modified nucleotides that 
15 are complementary to a contiguous sequence of nucleotides in the in the nucleic 

acid molecule of claim 57. 

68. An antibody that specifically binds to the MTSP of claim 57, or a 
fragment or derivative of the antibody containing a binding domain thereof, 
wherein the antibody is a polyclonal antibody or a monoclonal antibody. 

20 69. A method for treating or preventing a neoplastic disease, in a 

mammal, comprising administering to a mammal an effective amount of an 
inhibitor of an MTSP3 of claim 55. 

70. The method of claim 69, wherein the inhibitor is an antibody that 
specifically binds to the MTSP3, or a fragment or derivative of the antibody 

25 containing a binding domain thereof, wherein the antibody is a polyclonal 
antibody or a monoclonal antibody. 

71. A substantially purified membrane-type serine protease 4 (MTSP4) 

72. The substantially purified MTSP4 that is an MTSP4-L or an 
MTSP4-S. 

30 73. The MTSP4 of claim 71 that is selected from the group consisting 

of: 
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a polypeptide encoded by the sequence of nucleotides set forth in 
SEQ ID No. 7 or 9; 

a polypeptide encoded by a sequence of nucleotides that 
hybridizes under conditions of high stringency to the sequence of nucleotides set 
5 forth in SEQ ID No. 7 or 9; 

a polypeptide that comprises the sequence of amino acids set 
forth in SEQ ID No. 6, 8 or 10; and 

a polypeptide encoded by a splice variant of the sequence of 
nucleotides set forth in SEQ ID No. 7 or 9. 

10 

74. The MTSP4 of claim 73 that is an MTSP4-L or an MTSP4-S. 

75. A nucleic acid molecule, comprising a sequence of nucleotides that 
encodes the polypeptide of claim 73. 

76. A vector comprising the nucleic acid molecule of claim 74. 
15 77. A cell, comprising the vector of claim 76. 

78. The cell of claim 77 that expresses the MTSP4 on its surface. 

79. The cell of claim 77 that is a prokaryotic cell. 

80. The cells of claim 77 that is a eukaryotic cell. 

81 . The cell of claim 77 that is selected from among a bacterial cell, a 
20 yeast cell, a yeast cell, a plant cell, an insect cell and an animal cell. 

82. The cell of claim 77 that is a mammalian cell. 

83. The cell of claim 77, wherein the MTSP4 comprises the sequence 
of amino acids set forth in SEQ ID No. 6, 8 or 10; or a sequence of amino acids 
encoded by a sequence of nucleotides that hybridizes under conditions of high 

25 stringency to the sequence of nucleotides set forth in SEQ ID No. 7 or 9; or a 

protein having at least about 90% sequence identity with the sequence of amino 
acids set forth in SEQ ID No. 8 or 10 and retaining protease activity. 

84. A method for producing an MTSP4, comprising: 

culturing the cell of claim 77 under conditions whereby the encoded 
30 protein is expressed by the cell; and 

recovering the expressed protein. 
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85. An antisense nucleic acid molecule that comprises at least 14 
contiguous nucleotides or modified nucleotides that are complementary to a 
contiguous sequence of nucleotides in the nucleic acid molecule of claim 75 or 

comprises at least 1 6 contiguous nucleotides or modified nucleotides that 
5 are complementary to a contiguous sequence of nucleotides in the nucleic acid 

molecule of claim 75. 

86. An antibody that specifically binds to the MTSP of claim 72, or a 
fragment or derivative of the antibody containing a binding domain thereof, 
wherein the antibody is a polyclonal antibody or a monoclonal antibody. 

10 87. An antibody that specifically binds to the MTSP of claim 73, or a 

fragment or derivative of the antibody containing a binding domain thereof, 
wherein the antibody is a polyclonal antibody or a monoclonal antibody. 

88. A method for treating or preventing a neoplastic disease, in a 
mammal, comprising administering to a mammal an effective amount of an 

15 inhibitor of an MTSP4 of claim 71 . 

89. The method of claim 88, wherein the inhibitor is an antibody that 
specifically binds to an MTSP4, or a fragment or derivative of the antibody 
containing a binding domain thereof, wherein the antibody is a polyclonal 
antibody or a monoclonal antibody. 

20 90. A substantially purified membrane-type serine protease 6 (MTSP6) 

selected from the group consisting of: 

a polypeptide encoded by the sequence of nucleotides set forth in 

SEQ ID No. 1 1; 

a polypeptide encoded by a sequence of nucleotides that 

25 hybridizes along the full length thereof under conditions of high stringency to the 
sequence of nucleotides set forth in SEQ ID No. 1 1 ; 

a polypeptide that comprises the sequence of amino acids set 
forth as amino acids 217-443 of SEQ ID No. 1 2; 

a polypeptide encoded by a splice variant of the sequence of 

30 nucleotides set forth in SEQ ID No. 1 1 . 

91. A nucleic acid molecule, comprising a sequence of nucleotides that 

encodes the polypeptide of claim 90. 
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92. A vector comprising the nucleic acid molecule of claim 91 . 

93. A cell, comprising the vector of claim 92. 

94. The cell of claim 93 that expresses the MTSP6 on its surface. 

95. The cell of claim 93 that is a prokaryotic cell. 
5 96. The cells of claim 93 that is a eukaryotic cell. 

97. The cell of claim 93 that is selected from among a bacterial cell, a 
yeast cell, a yeast cell, a plant cell, an insect cell and an animal cell. 

98. The cell of claim 93 that is a mammalian cell. 

99. The cell of claim 93, wherein the MTSP6 comprises the sequence 
10 of amino acids set forth in SEQ ID No. 12; or a sequence of amino acids 

encoded by a sequence of nucleotides that hybridizes along the full length 
thereof under conditions of high stringency to the sequence of nucleotides set 
forth in SEQ ID No. 1 1; or a protein having at least about 95% sequence identity 
with the sequence of amino acids set forth in SEQ ID No. 1 2 and retaining 
15 protease activity. 

100. A method for producing an MTSP6, comprising: 

culturing the cell of claim 93 under conditions whereby the encoded 
protein is expressed by the cell; and 

recovering the expressed protein. 
20 101. An antisense nucleic acid molecule that comprises at least 14 

contiguous nucleotides or modified nucleotides that are complementary to a 
contiguous sequence of nucleotides in the nucleic acid molecule of claim 91; or 

comprises at least 1 6 contiguous nucleotides or modified nucleotides that 
are complementary to a contiguous sequence of nucleotides in the in the nucleic 
25 acid molecule of claim 91 . 

102. An antibody that specifically binds to the MTSP of claim 90, or a 
fragment or derivative of the antibody containing a binding domain thereof, 
wherein the antibody is a polyclonal antibody or a monoclonal antibody. 

103. A method for treating or preventing a neoplastic disease, in a 
30 mammal, comprising administering to a mammal an effective amount of an 

inhibitor of an MTSP6 of claim 90. 
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104. The method of claim 103, wherein the inhibitor is an antibody that 
specifically binds to the MTSP6, or a fragment or derivative of the antibody 
containing a binding domain thereof, wherein the antibody is a polyclonal 
antibody or a monoclonal antibody. 
5 105. A recombinant non-human animal, wherein an endogenous gene of 

an MTSP has been deleted or inactivated by homologous recombination or 
insertional mutagenesis of the animal or an ancestor thereof. 

106. A recombinant non-human animal of claim 105, wherein the MTSP 
is an MTSP1 , MTSP3, MTSP4 or MTSP6. 
10 107. A conjugate, comprising: 

a) an MTSP3 or MTSP4 or an MTSP6 of claim 90; and 

b) a targeting agent linked to the protein directly or via a linker. 

108. The conjugate of claim 106, wherein the targeting agent permits 
i) affinity isolation or purification of the conjugate; 

•j 5 jj) attachment of the conjugate to a surface; 

iii) detection of the conjugate; or 

iv) targeted delivery to a selected tissue or cell. 

109. A combination, comprising: 

a) an inhibitor of the catalytic activity of an MTSP3 or MTSP4 or 
20 MTSP6 of claim 90; and 

b) another treatment or agent selected from anti-tumor and anti- 
angiogenic treatments or agents. 

1 10. The combination of claim 109, wherein the inhibitor and the anti- 
tumor and/or anti-angiogenic agent are formulated in a single pharmaceutical 
25 composition or each is formulated in separate pharmaceutical compositions. 

111. The combination of claim 109, wherein the inhibitor is selected 
from antibodies and antisense oligonucleotides. 

112. A solid support comprising two or more MTSP3 polypeptides 
and/or MTSP4 polypeptides and/or MTSP6 polypeptides of claim 90 linked 

30 thereto either directly or via a linker. 

1 13. The support of claim 112, wherein the polypeptides comprise an 

array. 
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114. A method for identifying compounds that modulate th protease 
activity of an MTSP selected from MTSP3, MTSP4 or MTSP6 of claim 90, 
comprising: 

contacting the MTSP with a substrate proteolytically cleaved by the 
5 MTSP, and, either simultaneously, before or after, adding a test compound or 
plurality thereof; 

measuring the amount of substrate cleaved in the presence of the test 
compound; and 

selecting compounds that change the amount cleaved compared to a 
1 0 control, whereby compounds that modulate the activity of the MTSP are 
identified. 

115. The method of claim 114, wherein the test compounds are small 
molecules, peptides, peptidomimetics, natural products, antibodies or fragments 
thereof. 

15 117. The method of claim 1 14, wherein the change in the amount 

cleaved is assessed by comparing the amount cleaved in the presence of the test 
compound with the amount in the absence of the test compound. 

1 1 8. The method of claim 114, wherein a plurality of the test 
substances are screened for simultaneously. 
20 119. The method of claim 1 18, wherein a plurality of the polypeptides 

are linked to a solid support. 

1 20. A method of identifying a compound that specifically binds to a 
MTSP selected from MTSP3, MTSP4 and the MTSP6 of claim 90, comprising: 
contacting the MTSP polypeptide with a test compound or plurality 
25 thereof under conditions conducive to binding thereof; and 

identifying compounds that specifically binds to the MTSP. 
121. A method of identifying a compound that specifically binds to a 
MTSP selected from MTSP3, MTSP4 and the MTSP6 of claim 90, comprising: 
contacting the MTSP polypeptide with a test compound or plurality 
30 thereof under conditions conducive to binding thereof; and 
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identifying compounds that specifically binds to the MTSP. 

122. The method of any claims 121, wherein the polypeptide is linked 
either directly or indirectly via a linker to a solid support. 

123. The method of claim 121, wherein the test compounds are small 
5 molecules, peptides, peptidomimetics, natural products, antibodies or fragments 

thereof. 

124. The method of claim 121, wherein a plurality of the test 
substances are screened for simultaneously. 

125. The method of claim 124, wherein a plurality of the polypeptides 
10 are linked to a solid support. 

126. An MTSP6 polypeptide, comprising amino acids set forth as amino 
acids 46-55 in SEQ ID Mo. 12 and/or amino acids 368-394 of SEQ ID No. 12, 
and that is encoded by a sequence of nucleic acids that hybridizes under 
moderate stringency to nucleic acid encoding the polypeptide set forth in SEQ ID 

15 No. 12. 

127. The polypeptide of claim 126 that comprises the amino acids set 
forth as amino acids 46-55 in SEQ ID No. 12 and/or amino acids 368-394 of 
SEQ ID No. 1 2, and that is encoded by a sequence of nucleic acids that 
hybridizes under high stringency along its full length or full length of the protease 

20 domain to nucleic acid encoding the polypeptide set forth in SEQ ID No. 12. 

128. The polypeptide of claim 126, comprising the sequence of amino 
acids set forth in SEQ ID No. 12. 

129. An isolated nucleic acid molecule, comprising a sequence of 
nucleic acids that encodes the polypeptide of claim 126. 

25 1 30. A method for treating tumors, comprising administering a prodrug 

that is specifically cleaved by an MTSP, whereby, upon contact with a cell that 
expresses MTSP activity, the prodrug is converted into an active drug. 

131 . The method of claim 130, wherein the MTSP is selected from 
among an MTSP3, MTSP4 and an MTSP6 of claim 90. 

30 132. The method of claim 130, wherein the MTSP is selected from 

among corin, enterpeptidase, human airway trypsin-like protease (HAT), MTSP1, 
TMPRSS2, and TMPRSS4. 
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133. A method of detecting neoplastic disease, comprising: detecting an 
MTSP3, MTSP4 or MTSP6 of claim 90 in a biological sample, wherein the 
amount detected differs from the amount in a subject who does not have 
neoplastic disease. 

5 134. The method of claim 133, wherein the biological sample is 

selected from the group consisting of blood, urine, saliva, tears, interstitial fluid, 
cerebrospinal fluid, ascites fluid, tumor tissue biopsy and circulating tumor cells. 

135. The method of claim 133, wherein the extracellular domain of the 
MTSP3, MTSP4 or MTSP6 is in the sample. 
10 136. A modulator of the activity of a MTSP, which modulator is 

identified by the method of any of claims 42 and 114. 

137. Use of the polyeptide of claim 1 for identifying compounds that 
modulate the activity of an MTSP. 

138. The use of claim 137, wherein the compounds inhibit the 
15 proteolytic acitivity thereof . 

1 39 The use of claim 1 37, wherein the MTSP is selected from among 
MTSP1 , MTSP3, MTSP4, MTSP6, corin, enterpeptidase, human airway trypsin- 
like protease (HAT), TMPRSS2, and TMPRSS4. 

140. Use of a polypeptide of any of claims 55, 71 and 90 for treatment 
20 of a neoplastic disease. 

141 . Use of a polypeptide of claim 55, 71 and 90 for formulation of a 
medicament of treatment of neoplastic disease. 
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V 

MTSP3 194 LACGKS LKTPRWGGEEASVDS WPWQVS IQYDKQHVCGGS I LD 236 

20S 

MTSP4-S 396 PQCDGRPDCRDGSDEEHCECGLQGPSSRIVGGAVSSEGEWPWQASLQVRGRHICGGALIA 455 

424 

MTSP4-L 540 PQCDGRPDCRDGSDEEHCECGLQGPSSRIVGGAVSSEGEWPWQASLQVRGRHIOK3ALIA 5 99 

568 

MTSP6 205 TACGHR RGYSSRIVGGNMSLLSQWPWQASLQFQGYHLCGGSVIT 248 

217 + 

MTSP3 237 PHWVLTAAHCFRKHTDVFN- -WKVRAGSDKLGS FPSLAVAKI 1 1 IEFNPMYPKDND 2 90 

MTSP4-S 456 DRWITAAHCFQEDSMASTVLWTVFIjGKVWQNSRWPGEVSPKVSRLLLHPYHEEDSHDYD 515 
MTSP4-L 600 DRWVITAAHCFQEDSMASTVLWTVFIiGKVWQNSRWPGEVSFKVSRLLiIjHPYHEEDSHDyD 718 
MTSP6 249 PLWIITAAHCVYDIiYIjPKS- -WTIQVGLVSLliD- -NPAPSHLVEKIVYHSKYKPKRLGND 304 

MTSP3 291 IALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWGFTKQNGGKMSDILLQASVQV 3 50 

MTSP4-S 516 VALLQLDHPWRSAAVRPVCLPARSHFFEPGLHCWITGWGALRE-GGPISNALQKVDVQL 574 

MTSP4-L 660 VALLQLDHPWRSAAVRPVCLPARSHFFEPGLHCWITGWGALRE-GGPISNALQKVDVQL 718 

* 

MTSP6 3 05 lAIiMKLAGPLTFNEMIQPVCLPNSEEN'FPDGKVCWTSGWGATED-GGDASPVLNHAAVPIi 363 

★ 

MTSP3 351 IDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPliMYQSDQ- -WHWGIVSWGY 408 

MTSP4-S 575 IPQDLCS* -EVYRYQWPRMLCAGYRKGKKDACQGDSGGPLVCKALSGRWFLAGLVSWGL 63 2 
MTSP4-L 719 IPQDIiCS--EVYRYQVTPRMLCAGYRKGKKDACQGDSGGPLVCKALSGRWFI*AGLVSWGL 77 6 
MTSP6 364 I SNKICNHRDVYGG 1 I SPSMLCAGYLTGGVDSCQGDSGPLVCQERR - LWKVL VGATS FG I 442 

MTSP3 409 GCGGPSTPGVYTKVSAYLNWIYNVWKAEL 437 

MTSP4-S 633 GCGRPNYFGVYTRITGVISWIQQVVT 658 

MTSP4-L • 777 GCGRPNYFGVYTRITGVISWIQQWT 8 02 

MTSP6 423 GCAEVNKPGVYTRVTSFLDWIHEQMERDLiKT 4 53 

^ cleavage site 

+ potential glycosylation site 

* unpaired cysteine 

FIG. 4 
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SEQUENCE LISTING 



<110> Edwin L. Madison 
Edgar O. Ong 
Jiunn-Chem Yeh 
Corvas International, Inc. 



<12 0> NUCLEIC ACID MOLECULES ENCODING 

TRANSMEMBRANE SERINE PROTEASES , THE ENCODED PROTEINS AND 
METHODS BASED THEREON 

<130> 24745-1607 

<140> 09/000,000 
<141> 2001-02-01 



<150> 60/213,124 
<151> 2000-06-22 

<150> 60/234,840 
<151> 2000-06-22 

<150> 60/179,982 
<151> 2000-02-03 

<150> 60/183,542 
<;151> 2000-02-18 

<150> 09/657,968 
<151> 2000-02-08 

<160> 72 

<170> FastSEQ for Windows Version 4.0 



<210> 1 

<211> 3147 

<212> DNA 

<213> Homo Sapien 



<220> 

<223> Nucleotide encoding MTSP1 



<221> CDS 

<222> (23) . . . (2589) 
<300> 

<301> O'Brien, T.J. and Tanimoto, H. 
<308> GenBank AR081724 
<310> US Pat 5972616 
<311> 1998-02-20 
<312> 1999-10-26 



<400> 1 

tcaagagcgg cctcggggta cc atg ggg age gat egg gec cgc aag ggc gga 

Met Gly Ser Asp Arg Ala Arg Lys Gly Gly 
15 10 

ggg ggc ccg aag gac ttc ggc gcg gga etc aag tac aac tec egg cac 
Gly Gly Pro Lys Asp Phe Gly Ala Gly Leu Lys Tyr Asn Ser Arg His 

15 20 25 



52 



100 
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340 



nifi f t 3 ? * H? c ttg gag aaa aac 9 ta aa 9 ttc ctg cca gtc aac 148 
Glu Lys Val Asn Gly Leu Glu Glu Gly Val Glu Phe Leu Pro Val Asn 

30 35 40 

aac gtc aag aag gtg gaa aag cat ggc ccg ggg cgc tgg gtg gtg eta 196 
Asn Val Lys Lys Val Glu Lys His Gly Pre- Gly Arg Tr^ Val Val Leu 
45 50 55 

Ala f?f ^? ?? C 2?° ? tC T CtC tt9 9tC ttg Ctg gaa atc aac ttc 244 

Ala Ala Val Leu lie Gly Leu Leu Leu Val Leu Leu Gly lie Gly Phe 

60 65 70 

ctg gtg tgg cat ttg cag tac egg gac gtg cgt gtc cag aag gtc ttc 292 
Leu Val Trp His Leu Gin Tyr Arg Asp Val Arg Val Gin Lys Val Phe 
75 80 8 5 * 9 o 

aat ggc tac atg agg atc aca aat gag aat ttt gtg gat gec tac qaq 
Asn Gly Tyr Met Arg He Thr Asn Glu Asn Phe Val Asp Ala Tyr GlS 

S> 5 100 105 

aac tec aac tec act gag ttt gta age ctg gee age aag gtg aag gac 
Asn Ser Asn Ser Thr Glu Phe Val' Ser Leu Ala Ser Lys Val Lys Asp 

HO us 120 

gcg ctg aag ctg ctg tac age gga gtc cca ttc ctg ggc ccc tac cac 43 6 

Ala Leu Lys Leu Leu Tyr Ser Gly Val Pro Phe Leu Gly Pro Tyr His 
"5 130 135 

aag gag teg get gtg acg gec ttc age gag ggc age gtc atc gec tac 484 
Lys Glu Ser Ala Val Thr Ala Phe Ser Glu Gly Ser Val He Ala Tyr 
140 145 i 50 

tac tgg tct gag ttc age atc ccg cag cac ctg gtg gag gag gee gaa 532 
Tyr Trp Ser Glu Phe Ser He Pro Gin His Leu Val Glu Glu Ala GlS 
155 160 165 170 

cgc gtc atg gee gag gag cgc gta gtc atg ctg ccc ccg egg gcg cgc 
Arg Val Met Ala Glu Glu Arg Val Val Met Leu Pro Pro Arg Ala Arg 

175 180 185 



388 



580 



tec ctg aag tec ttt gtg gtc acc tea gtg gtg get ttc ccc acg gac 628 
Ser Leu Lys Ser Phe Val Val Thr Ser Val Val Ala Phe Pro Thr Asp 

190 195 200 

tec aaa aca gta cag agg acc cag gac aac age tgc age ttt ggc ctg 676 
Ser Lys Thr Val Gin Arg Thr Gin Asp Asn Ser Cys Ser Phe Gly Leu 
205 210 215 

cac gee cgc ggt gtg gag ctg atg cgc ttc acc acg ccc ggc ttc cct 724 
Hxs Ala Arg Gly Val Glu Leu Met Arg Phe Thr Thr Pro Gly Phe Pro 
220 225 230 

gac age ccc tac ccc get cat gec cgc tgc cag tgg gee ctg egg ggg 772 
Asp Ser Pro Tyr Pro Ala His Ala Arg Cys Gin Trp Ala Leu Arg Gly 
235 240 245 250 

gac gec gac tea gtg ctg age etc acc ttc cgc age ttt gac ctt acq 820 
Asp Ala Asp Ser Val Leu Ser Leu Thr Phe Arg Ser Phe Asp Leu Ala 

255 260 265 

o° C 1 9 ° ? aC S? 9 cac agc aac aac ct 9 Stg acg gtg tac aac acc ctg 868 
Ser Cys Asp Glu Arg Gly Ser Asp Leu Val Thr Val Tyr Asn Thr Leu 
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270 275 280 

age ccc atg gag ccc cac gec ctg gtg cag ttg tgt ggc acc tac cct 
Ser Pro Met Glu Pro His Ala Leu Val Gin Leu Cys Gly Thr Tyr Pro 
285 290 295 

ccc tec tac aac ctg acc ttc cac tec tec cag aac gtc ctg etc ate 
Pro Ser Tyr Asn Leu Thr Phe His Ser Ser Gin Asn Val Leu Leu He 
300 305 310 



gtc tgc gac agt gtg aac gac tgc gga gac aac age gac gag cag ggg 
Val Cys Asp Ser Val Asn Asp Cys Gly Asp Asn Ser Asp Glu Gin Gly 

510 515 520 



916 



964 



aca ctg ata acc aac act gag egg egg cat ccc ggc ttt gag gee acc 1012 
Thr Leu lie Thr Asn Thr Glu Arg Arg His Pro Gly Phe Glu Ala Thr 
315 320 325 330 

ttc ttc cag ctg cct agg atg age age tgt gga ggc cgc tta cgt aaa 
Phe Phe Gin Leu Pro Arg Met Ser Ser Cys Gly Gly Arg Leu Arg Lys 

335 340 345 



1060 



1252 



1300 



gee cag ggg aca ttc aac age ccc tac tac cca ggc cac tac cca ccc 1108 
Aia Gin Gly Thr Phe Asn Ser Pro Tyr Tyr Pro Gly His Tyr Pro Pro 

350 355 360 

aac att gac tgc aca tgg aac att gag gtg ccc aac aac cag cat gtg 1156 
Asn He Asp Cys Thr Trp Asn He Glu Val Pro Asn Asn Gin Has Val 
365 370 375 

aag gtg age ttc aaa ttc ttc tac ctg ctg gag ccc ggc gtg cct gcg 1204 
Lys Val Ser Phe Lys Phe Phe Tyr Leu Leu Glu Pro Gly Val Pro Ala 
380 385 390 

qqc acc tgc ccc aag gac tac gtg gag ate aat ggg gag aaa tac tgc 
Glv Thr Cys Pro Lys Asp Tyr Val Glu He Asn Gly Glu Lys Tyr Cys 
395 400 405 410 

gga gag agg tec cag ttc gtc gtc acc age aac age aac aag ate aca 
Glv Glu Arg Ser Gin Phe Val Val Thr Ser Asn Ser Asn Lys He Thr 
* 415 420 425 

gtt cgc ttc cac tea gat cag tec tac acc gac acc ggc ttc tta get 
Val Arg Phe His Ser Asp Gin Ser Tyr Thr Asp Thr Gly Phe Leu Ala 

430 435 440 

gaa tac etc tec tac gac tec agt gac cca tgc ccg ggg cag ttc acg 
Glu Tyr Leu Ser Tyr Asp Ser Ser Asp Pro Cys Pro Gly Gin Phe Thr 
445 450 455 

tgc cgc acg ggg egg tgt ate egg aag gag ctg cgc tgt gat ggc tgg 
Cys Arg Thr Gly Arg Cys He Arg Lys Glu Leu Arg Cys Asp Gly Trp 
460 465 470 

gec gac tgc acc gac- cac age gat gag etc aac tgc agt tgc gac gec 1492 
Ala Asp Cys Thr Asp His Ser Asp Glu Leu Asn Cys Ser Cys Asp Ala 
475 480 485 490 

ggc cac cag ttc acg tgc aag aac aag ttc tgc aag ccc etc ttc tgg 
Gly His Gin Phe Thr Cys Lys Asn Lys Phe Cys Lys Pro Leu Phe Trp 

495 500 505 



1348 



1396 



1444 



1540 



1588 
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tgc agt tgt ccg gcc cag acc ttc agg tgt tec aat ggg aag tgc etc 1636 

Cys Ser Cys Pro Ala Gin Thr Phe Arg Cys Ser Asn Gly Lys Cys Leu 

525. 530 535 

teg aaa age cag cag tgc aat ggg aag gac gac tgt ggg gac ggg tec 1684 

Ser Lys Ser Gin Gin Cys Asn Gly Lys Asp Asp Cys Gly Asp Gly Ser 

540 545 550 

gac gag gcc tec tgc ccc aag gtg aac gtc gtc act tgt acc aaa cac 1732 

Asp Glu Ala Ser Cys Pro Lys Val Asn Val Val Thr Cys Thr Lys His 

555 560 565 570 



acc tac cgc tgc etc aat ggg etc tgc ttg age aag ggc aac cct gag 
Thr Tyr Arg Cys Leu Asn Gly Leu Cys Leu Ser Lys Gly Asn Pro Glu 

575 580 585 



ctg gtc tct gcc gca cac tgc tac ate gat gac aga gga ttc agg tac 

Leu Val Ser Ala Ala His Cys Tyr lie Asp Asp Arg Gly Phe Arg Tyr 

655 660 665 

tea gac ccc acg cag tgg acg gcc ttc ctg ggc ttg cac gac cag age 

Ser Asp Pro Thr Gin Trp Thr Ala Phe Leu Gly Leu His Abp Gin Ser 

670 675 680 



tec cac ccc ttc ttc aat gac ttc acc ttc gac tat gac ate gcg ctg 

Ser His Pro Phe Phe Asn Asp Phe Thr Phe Asp Tyr Asp lie Ala Leu 

700 705 710 

ctg gag ctg gag aaa ccg gca gag tac age tec atg gtg egg ccc ate 

Leu Glu Leu Glu Lys Pro Ala Glu Tyr Ser Ser Met Val Arg Pro lie 

715 720 725 730 

tgc ctg ccg gac gcc tec cat gtc ttc cct gcc ggc aag gcc ate tgg 

Cys Leu Pro Asp Ala Ser His Val Phe Pro Ala Gly Lys Ala lie Trp 

735 740 745 

gtc acg ggc tgg gga cac acc cag tat gga ggc act ggc gcg ctg ate 
Val Thr Gly Trp Gly His Thr Gin Tyr Gly Gly Thr Gly Ala Leu lie 

750 755 760 



1780 



tgt gac ggg aag gag gac tgt age gac ggc tea gat gag aag gac tgc 1828 
Cys Asp Gly Lys Glu Asp Cys Ser Asp Gly Ser Asp Glu Lys Asp Cys 

590 595 600 

gac tgt ggg ctg egg tea ttc acg aga cag get cgt gtt gtt ggg ggc 1876 
Asp Cys Gly Leu Arg Ser Phe Thr Arg Gin Ala Arg Val Val Gly Gly 
605 610 615 

acg gat gcg gat gag ggc gag tgg ccc tgg cag gta age ctg cat get 1924 
Thr Asp Ala Asp Glu Gly Glu Trp Pro Trp Gin Val Ser Leu His Ala 
620 625 630 

ctg ggc cag ggc cac ate tgc ggt get tec etc ate tct ccc aac tgg 1972 
Leu Gly Gin Gly His lie Cys Gly Ala Ser Leu lie Ser Pro Asn Trp 
635 640 645 650 



2020 



2068 



cag cgc age gcc cct ggg gtg cag gag cgc agg etc aag cgc ate ate 2116 
Gin Arg Ser Ala Pro Gly Val Gin Glu Arg Arg Leu Lys Arg lie lie 
685 690 695 



2164 



2212 



2260 



2308 



ctg caa aag ggt gag ate cgc gtc ate aac cag acc acc tgc gag aac 2356 
Leu Gin Lys Gly Glu lie Arg Val lie Asn Gin Thr Thr Cys Glu Asn 
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765 770 775 

etc ctg ccg cag cag ate acg ccg cgc atg atg tgc gtg ggc ttc etc 

Leu Leu Pro Gin Gin lie Thr Pro Arg Met Met Cys Val Gly Phe Leu 

780 785 790 



<400> 2 
























Phe 


Met 


Gly 


Ser 


Asp 


Arg 


Ala Arg 


Lys 


Gly Gly Gly Gly 


Pro 


Lys 


Asp 


1 




5 










10 






15 




Gly 


Ala 


Gly 


Leu 


Lys 


Tyr 


Asn 


Ser 


Arg 


His Glu Lys 


Val 


Asn 


Gly 


Leu 




20 








25 






30 


Val 


Glu 


Glu 


Glu 


Gly 


Val 


Glu 


Phe 


Leu 


Pro 


Val 


Asn Asn Val 


Lys 


Lys 






35 










40 






45 




He 


Gly 


Lys 


His 


Gly 


Pro 


Gly Arg 


Trp 


Val 


Val 


Leu Ala Ala 


Val 


Leu 


50 








55 






60 




His 




Gin 


Leu 


Leu 


Leu 


Val 


Leu 


Leu 


Gly 


lie 


Gly 


Phe Leu Val 


Trp 


Leu 


65 










70 




75 








80 


Tyr 


Arg 


Asp 


Val 


Arg Val 


Gin 


Lys 


Val 


Phe Asn Gly Tyr 


Met 


Arg 


He 




85 










90 






95 


Glu 


Thr 


Asn 


Glu 


Asn 


Phe 


Val 


Asp 


Ala 


Tyr 


Glu Asn Ser 


Asn 


Ser 


Thr 








100 








105 






110 






Phe 


Val 


Ser 


Leu 


Ala 


Ser 


Lys 


Val 


Lys 


Asp Ala Leu 


Lys 


Leu 


Leu 


Tyr 






115 










120 






125 








Ser 


Gly 


Val 


Pro 


Phe 


Leu Gly 


Pro 


Tyr 


His Lys Glu 


Ser 


Ala 


Val 


Thr 




130 










135 






140 






Phe 




Ala 


Phe 


Ser 


Glu 


Gly 


Ser 


Val 


He 


Ala 


Tyr Tyr Trp 


Ser 


Glu 


Ser 


145 








150 








155 








160 


lie 


Pro 


Gin 


His 


Leu 


Val 


Glu 


Glu Ala Glu Arg Val 


Met 


Ala 


Glu 


Glu 










165 










170 






175 


Val 


Arg 


Val 


Val 


Met 


Leu 


Pro 


Pro 


Arg 


Ala 


Arg Ser Leu 


Lys 


Ser 


Phe 



2404 



age ggc ggc gtg gac tec tgc cag ggt gat tec ggg gga ccc ctg tec 2452 
Ser Gly Gly Val Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Ser 
795 800 805 810 

age gtg gag gcg gat ggg egg ate ttc cag gee ggt gtg gtg age tgg 2500 
Ser Val Glu Ala Asp Gly Arg He Phe Gin Ala Gly Val Val Ser Trp 

815 820 825 

gga gac ggc tgc get cag agg aac aag cca ggc gtg tac aca agg etc 254 8 

Gly Asp Gly Cys Ala Gin Arg Asn Lys Pro Gly Val Tyr Thr Arg Leu 

830 835 840 

cct ctg ttt egg gac tgg ate aaa gag aac act ggg gta ta ggggccgggg 2599 
Pro Leu Phe Arg Asp Trp He Lys Glu Asn Thr Gly Val 
845 850 855 

ccacccaaat gtgtacacct gcggggccac ccatcgtcca ccccagtgtg cacgcctgca 2659 

ggctggagac tggaccgctg actgcaccag cgcccccaga acatacactg tgaactcaat 2719 

ctccagggct ccaaatctgc ctagaaaacc. tctcgcttcc tcagcctcca aagtggagct 2779 

gggaggtaga aggggaggac actggtggtt ctactgaccc aactgggggc aaaggtttga 283 9 

agacacagcc tcccccgcca gccccaagct gggecgagge gcgtttgtgt atatctgect 2 899 

cccctgtctg taaggagcag egggaaegga getteggage ctcctcagtg aaggtggtgg 2 959 

ggctgccgga tctgggctgt ggggcccttg ggccacgctc ttgaggaagc ccaggctcgg 3019 

aggaccctgg aaaacagacg ggtctgagac tgaaattgtt ttaccagctc ccagggtgga 3 079 

cttcagtgtg tgtatttgtg taaatgggta aaacaattta tttcttttta aaaaaaaaaa 3139 

aaaaaaaa 3147 

<210> 2 

<211> 855 

<212> PRT 

<213> Homo Sapien 
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180 

Val Thr Ser Val 
195 

Thr Gin Asp Asn 
210 

Leu Met Arg Phe 
225 

His Ala Arg Cys 

Ser Leu Thr Phe 

260 

Ser Asp Leu Val 
275 

Ala Leu Val Gin 
290 

Phe His Ser Ser 
305 

Glu Arg Arg His 

Met Ser Ser Cys 

340 

Ser Pro Tyr Tyr 
355 

Asn lie Glu Val 
370 

Phe Tyr Leu Leu 
385 

Tyr Val Glu He 

Val Val Thr Ser 

420 

Gin Ser Tyr Thr 
435 

Ser Ser Asp Pro 
450 

He Arg Lys Glu 
465 

Ser Asp Glu Leu 

Lys Asn Lys Phe 

500 

Asp Cys Gly Asp 
515 

Thr Phe Arg Cys 
530 

Asn Gly Lys Asp 
545 

Lys Val Asn Val 

Gly Leu Cys Leu 

580 

Cys Ser Asp Gly 
595 

Phe Thr Arg Gin 
610 

Glu Trp Pro Trp 
625 

Cys Gly Ala Ser 

Cys Tyr He Asp 

660 

Thr Ala Phe Leu 



Val Ala Phe Pro 

200 

Ser Cys Ser Phe 
215 

Thr Thr Pro Gly 
230 

Gin Trp Ala Leu 
245 

Arg Ser Phe Asp 

Thr Val Tyr Asn 

280 

Leu Cys Gly Thr 
295 

Gin Asn Val Leu 
310 

Pro Gly Phe Glu 
325 

Gly Gly Arg Leu 

Pro Gly His Tyr 

360 

Pro Asn Asn Gin 

375 

Glu Pro Gly Val 
390 

Asn Gly Glu Lys 
405 

Asn Ser Asn Lys 

Asp Thr Gly Phe 

440 

Cys Pro Gly Gin 
455 

Leu Arg Cys Asp 
470 

Asn Cys Ser Cys 
485 

Cys Lys Pro Leu 

Asn Ser Asp Glu 

520 

Ser Asn Gly Lys 
535 

Asp Cys Gly Asp 
550 

Val Thr Cys Thr 
565 

Ser Lys Gly Asn 

Ser Asp Glu Lys 

600 

Ala Arg Val Val 
615 

Gin Val Ser Leu 
630 

Leu He Ser Pro 
645 

Asp Arg Gly Phe 
Gly Leu His Asp 



185 

Thr Asp Ser Lys 

Gly Leu His Ala 

220 

Phe Pro Asp Ser 
235 

Arg Gly Asp Ala 
250 

Leu Ala Ser Cys 
265 

Thr Leu Ser Pro 

Tyr Pro Pro Ser 

300 

Leu He Thr Leu 
315 

Ala Thr Phe Phe 
330 

Arg Lys Ala Gin 
345 

Pro Pro Asn He 

His Val Lys Val 

380 

Pro Ala Gly Thr 
395 

Tyr Cys Gly Glu 
410 

He Thr Val Arg 
425 

Leu Ala Glu Tyr 

Phe Thr Cys Arg 

460 

Gly Trp Ala Asp 
475 

Asp Ala Gly His 
490 

Phe Trp Val Cys 
505 

Gin Gly Cys Ser 

Cys Leu Ser Lys 

540 

Gly Ser Asp Glu 
555 

Lys His Thr Tyr 
570 

Pro Glu Cys Asp 
585 

Asp Cys Asp Cys 

Gly Gly Thr Asp 

620 

His Ala Leu Gly 
635 

Asn Trp Leu Val 
650 

Arg Tyr Ser Asp 
665 

Gin Ser Gin Arg 



190 

Thr Val Gin Arg 
205 

Arg Gly Val Glu 

Pro Tyr Pro Ala 

240 

Asp Ser Val Leu 
255 

Asp Glu Arg Gly 
270 

Met Glu Pro His 
285 

Tyr Asn Leu Thr 

He Thr Asn Thr 

320 

Gin Leu Pro Arg 
335 

Gly Thr Phe Asn 
350 

Asp Cys Thr Trp 
365 

Ser Phe Lys Phe 

Cys Pro Lys Asp 

400 

Arg Ser Gin Phe 
415 

Phe His Ser Asp 
430 

Leu Ser Tyr Asp 
445 

Thr Gly Arg Cys 

Cys Thr Asp His 

480 

Gin Phe Thr Cys 
495 

Asp Ser Val Asn 
510 

Cys Pro Ala Gin 
525 

Ser Gin Gin Cys 

Ala Ser Cys Pro 

560 

Arg Cys Leu Asn 
575 

Gly Lys Glu Asp 
590 

Gly Leu Arg Ser 
605 

Ala Asp Glu Gly 

Gin Gly His He 

640 

Ser Ala Ala His 
655 

Pro Thr Gin Trp 
670 

Ser Ala Pro Gly 
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675 




Val 


Gin 


Glu 


Arg 




690 






Asp 


Phe 


Thr 


Phe 


705 








Ala 


Glu 


Tyr 


Ser 


His 


Val 


Phe 


Pro 








740 


Thr 


Gin 


Tyr 


Gly 






755 




Arg 


Val 


He 


Asn 




770 






Thr 


Pro 


Arg 


Met 


785 








Cys 


Gin 


Gly Asp 


Arg 


He 


Phe 


Gin 








820 


Arg 


Asn 


Lys 


Pro 






835 




lie 


Lys 


Glu 


Asn 




850 







680 

Arg Leu Lys Arg 
695 

Asp Tyr Asp He 
710 

Ser Met Val Arg 
725 

Ala Gly Lys Ala 

Gly Thr Gly Ala 

760 

Gin Thr Thr Cys 
775 

Met Cys Val Gly 
790 

Ser Gly Gly Pro 
805 

Ala Gly Val Val 

Gly Val Tyr Thr 

840 

Thr Gly Val 
855 



X1C 


T 1 - 
J. ic 




Til a 
XliS 








700 


iiia 


JjcU 


Leu 


Glu 






715 




Pro 


He 


Cys 


Leu 




730 






He 


Trp 


Val 


Thr 


745 








Leu 


He 


Leu 


Gin 


Glu 


Asn 


Leu 


Leu 








780 


Phe 


Leu 


Ser 


Gly 






795 




Leu 


Ser 


Ser 


Val 




810 






Ser 


Trp 


Gly Asp 


825 








Arg 


Leu 


Pro 


Leu 



685 

Pro Phe Phe Asn 

Leu Glu Lys Pro 

720 

Pro Asp Ala Ser 
735 

Gly Trp Gly His 
750 

Lys Gly Glu He 
765 

Pro Gin Gin He 

Gly Val Asp Ser 

800 

Glu Ala Asp Gly 
815 

Gly Cys Ala Gin 
830 

Phe Arg Asp Trp 
845 



<210> 3 

<211> 2137 

<212> DNA 

<213> Homo Sapien 



<220> 
<221> CDS 

<222> (261) . . . (1574) 

<223> DNA sequence encoding a transmembrane serine 
protease (MTSP3) protein 

<400> 3 

ccatcctaat acgactcact atagggctcg agcggccgcc cgggcaggtc agagagaggc 60 
agcagcttgc tcagcggaca aggatgctgg gcgtgaggga ccaaggcctg ccctgcactc 120 
gggcctcctc cagccagtgc tgaccaggga cttctgacct gctggccagc caggacctgt 180 
9tggggaggc cctcctgctg ccttggggtg acaatctcag ctccaggcta cagggagacc 240 
gggaggatca cagagccagc atg tta cag gat cct gac agt gat caa cct ctg 293 

Met Leu Gin Asp Pro Asp Ser Asp Gin Pro Leu 
15 10 

aac age etc gat gtc aaa ccc ctg cgc aaa ccc cgt ate ccc atg gag 341 
Asn Ser Leu Asp Val Lys Pro Leu Arg Lys Pro Arg He Pro Met Glu 

15 20 25 

ace ttc aga aag gtg ggg ate ccc ate ate ata gca eta ctg age ctg 389 
Thr Phe Arg Lys Val Gly He Pro He He He Ala Leu Leu Ser Leu 
30 35 40 

gcg agt ate ate att gtg gtt gtc etc ate aag gtg att ctg gat aaa 437 
Ala Ser He He He Val Val Val Leu He Lys Val He Leu Asp Lys 
45 50 55 

tac tac ttc etc tgc ggg cag cct etc cac ttc ate ccg agg aag cag 485 
Tyr Tyr Phe Leu Cys Gly Gin Pro Leu His Phe He Pro Arg Lys Gin 
60 65 70 75 



ctg tgt gac gga gag ctg gac tgt ccc ttg ggg gag gac gag gag cac 
Leu Cys Asp Gly Glu Leu Asp Cys Pro Leu Gly Glu Asp Glu Glu His 



533 
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80 85 90 

tgt gtc aag age ttc ccc gaa ggg cct gca gtg gca gtc cgc etc tec 581 

Cys Val Lys Ser Phe Pro Glu Gly Pro Ala Val Ala Val Arg Leu Ser 

95 100 105 

aag gac cga tec aca ctg cag gtg ctg gac teg gec aca ggg aac tgg 629 

Lys Asp Arg Ser Thr Leu Gin Val Leu Asp Ser Ala Thr Gly Asn Trp 
110 115 120 

ttc tct gec tgt ttc gac aac ttc aca gaa get etc get gag aca gee 677 

Phe Ser Ala Cys Phe Asp Asn Phe Thr Glu Ala Leu Ala Glu Thr Ala 
125 130 135 

tgt agg cag atg ggc tac age age aaa ccc ace ttc aga get gtg gag 725 

Cys Arg Gin Met Gly Tyr Ser Ser Lys Pro Thr Phe Arg Ala Val Glu 

140 145 150 155 

att ggc cca gac cag gat ctg gat gtt gtt gaa ate aca gaa aac age 773 

lie Gly Pro Asp Gin Asp Leu Asp Val Val Glu lie Thr Glu Asn Ser 

160 165 170 

cag gag ctt cgc atg egg aac tea agt ggg ccc tgt etc tea ggc tec 821 

Gin Glu Leu Arg Met Arg Asn Ser Ser Gly Pro Cys Leu Ser Gly Ser 

175 180 185 

ctg gtc tec ctg cac tgt ctt gee tgt ggg aag age ctg aag ace ccc 869 

Leu Val Ser Leu His Cys Leu Ala Cys Gly Lys Ser Leu Lys Thr Pro 
190 195 200 

cgt gtg gtg ggt ggg gag gag gee tct gtg gat tct tgg cct tgg cag 917 

Arg Val Val Gly Gly Glu Glu Ala Ser Val Asp Ser Trp Pro Trp Gin 
205 210 215 

gtc age ate cag tac gac ata cag cac gtc tgt gga ggg age ate ctg 965 

Val Ser lie Gin Tyr Asp lie Gin His Val Cys Gly Gly Ser lie Leu 

220 225 230 235 

gac ccc cac tgg gtc etc acg gca gee cac tgc ttc agg aaa cat ace 1013 

Asp Pro His Trp Val Leu Thr Ala Ala His Cys Phe Arg Lys His Thr 

240 245 250 

gat gtg ttc aac tgg aag gtg egg gca ggc tea gac aaa ctg ggc age 1061 

Asp Val Phe Asn Trp Lys Val Arg Ala Gly Ser Asp Lys Leu Gly Ser 

255 260 265 

ttc cca tec ctg get gtg gec aag ate ate ate att gaa ttc aac ccc 1109 

Phe Pro Ser Leu Ala Val Ala Lys lie lie lie lie Glu Phe Asn Pro 
270 275 280 

atg tac ccc aaa gac aat gac ate gec etc atg aag ctg cag ttc cca 1157 

Met Tyr Pro Lys Asp Asn Asp lie Ala Leu Met Lys Leu Gin Phe Pro 
285 290 295 

etc act ttc tea ggc aca gtc agg etc ate tgt ctg ccc ttc ttt gat 1205 

Leu Thr Phe Ser Gly Thr Val Arg Leu lie Cys Leu Pro Phe Phe Asp 

300 305 310 315 

gag gag etc act cca gec acc cca etc tgg ate att gga tgg ggc ttt 1253 

Glu Glu Leu Thr Pro Ala Thr Pro Leu Trp lie lie Gly Trp Gly Phe 

320 325 330 
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acg aag cag aat gga ggg aag atg tct gac ata ctg ctg cag gcg tea 1301 
Thr Lys Gin Asn Gly Gly Lys Met Ser Asp lie Leu Leu Gin Ala Ser 

335 340 345 



gtc cag gtc att gac age aca egg tgc aat gca gac gat gcg tac cag 1349 
Val Gin Val lie Asp Ser Thx Arg Cys Asn Ala Asp Asp Ala Tyr Gin 
350 355 360 

ggg gaa gtc acc gag aag atg atg tgt gca ggc ate ccg gaa ggg ggt 1397 
Gly Glu Val Thr Glu Lys Met Met Cys Ala Gly lie Pro Glu Gly Gly 
365 370 375 

gtg gac acc tgc cag ggt gac agt ggt ggg ccc ctg atg tac caa tct 1445 
Val Asp Thr Cys Gin Gly Asp Ser Gly Gly Pro Leu Met Tyr Gin Ser 
380 385 390 395 

gac cag tgg cat gtg gtg ggc ate gtt age tgg ggc tat ggc tgc ggg 1493 
Asp Gin Trp His Val Val Gly lie Val Ser Trp Gly Tyr Gly Cys Gly 

400 405 410 

ggc ccg age acc cca gga gta tac acc aag gtc tea gec tat etc aac 1541 
Gly Pro Ser Thr Pro Gly Val Tyr Thr Lys Val Ser Ala Tyr Leu Asn 

415 420 425 

tgg ate tac aat gtc tgg aag get gag ctg taa tgctgctgcc ectttgeagt 1594 
Trp lie Tyr Asn Val Trp Lys Ala Glu Leu * 
430 435 

getgggagee gcttccttcc tgccctgccc acctggggat cccccaaagt cagacacaga 1654 

gcaagagtcc ccttgggtac acccctctgc ccacagcctc agcatttctt ggagcagcaa 1714 

agggectcaa ttcctgtaag agaccctcgc ageccagagg cgcccagagg aagtcagcag 1774 

ccctagctcg gccacacttg gtgctcccag catcccaggg agagacacag cccactgaac 1834 

aaggtctcag gggtattget aagccaagaa ggaactttcc cacactactg aatggaagca 1894 

ggctgtcttg taaaagecca gatcactgtg ggctggagag gagaaggaaa gggtctgege 1954 

cagccctgtc cgtcttcacc catccccaag cctactagag caagaaacca gttgtaatat 2014 

aaaatgeact gccctactgt tggtatgact accgttacct actgttgtca t tgt tat tac 2074 

agetatggee actattatta aagagctgtg taacaaaaaa aaaaaaaaaa aaaaaaaaaa 2134 

aaa 2137 

<210> 4 

<211> 437 

<212> PRT 

<213> Homo Sapien 



<400> 4 



Met 


Leu 


Gin 


Asp 


Pro 


Asp 


Ser 


Asp 


Gin 


Pro 


Leu 


Asn 


Ser 


Leu 


Asp 


Val 


1 






5 








10 










15 




Lys 


Pro 


Leu 


Arg 


Lys 


Pro 


Arg 


lie 


Pro 


Met 


Glu 


Thr 


Phe 


Arg 


Lys 


Val 






20 










25 










30 






Gly 


lie 


Pro 


lie 


lie 


He 


Ala 


Leu 


Leu 


Ser 


Leu 


Ala 


Ser 


He 


He 


He 




35 










40 










45 








Val 


Val 
50 


Val 


Leu 


lie 


Lys 


Val 
55 


lie 


Leu 


Asp 


Lys 


Tyr 
60 


Tyr 


Phe 


Leu 


Cys 


Gly Gin 


Pro 


Leu 


His 


Phe 


He 


Pro 


Arg 


Lys 


Gin 


Leu 


Cys 


Asp 


Gly 


Glu 


65 










70 










75 










80 


Leu 


Asp 


Cys 


Pro 


Leu 


Gly 


Glu 


Asp 


Glu 


Glu 


His 


Cys 


Val 


Lys 


Ser 


Phe 








85 








90 










95 




Pro 


Glu 


Gly 


Pro 


Ala 


Val 


Ala 


Val 


Arg 


Leu 


Ser 


Lys 


Asp 


Arg 


Ser 


Thr 






100 










105 










110 






Leu 


Gin 


Val 
115 


Leu 


Asp 


Ser 


Ala 


Thr 
120 


Gly 


Asn 


Trp 


Phe 


Ser 
125 


Ala 


Cys 


Phe 


Asp 


Asn 


Phe 


Thr 


Glu 


Ala 


Leu 


Ala 


Glu 


Thr 


Ala 


Cys 


Arg 


Gin 


Met 


Gly 



WO 01/57194 



10/66 



PCT/US01/03471 





130 










135 










140 










Tyr 


Ser 


Ser 


Lys 


Pro 


Thr 


Phe 


Arg 


Ala 


Val 


Glu 


He 


Gly 


Pro 


Asp 


Gin 


145 








150 










155 










160 


Asp 


Leu 


Asp 


Val 


Val 


Glu 


He 


Thr 


Glu 


Asn 


Ser 


Gin 


Glu 


Leu 


Arg 


Met 






165 










170 










175 




Arg 


Asn 


Ser 


Ser 


Gly 


Pro 


Cys 


Leu 


Ser 


Gly 


Ser 


Leu 


Val 


Ser 


Leu 


His 






180 










185 










190 






Cys 


Leu 


Ala 


Cys 


Gly 


Lys 


Ser 


Leu 


Lys 


Thr 


Pro 


Arg 


Val 


Val 


Gly Gly 






195 










200 










205 








Glu 


Glu 
210 


Ala 


Ser 


Val 


Asp 


Ser 
215 


Trp 


Pro 


Trp 


Gin 


Val 
220 


Ser 


He 


Gin 


Tyr 


Asp 


lie 


Gin 


His 


Val 


Cys 


Gly 


Gly 


Ser 


He 


Leu 


Asp 


Pro 


His 


Trp 


Val 


225 










230 










235 










240 


Leu 


Thr 


Ala 


Ala 


His 
245 


Cys 


Phe 


Arg 


Lys 


His 
250 


Thr 


Asp 


Val 


Phe 


Asn 
255 


Trp 


Lys 


Val 


Arg 


Ala 


Gly 


Ser 


Asp 


Lys 


Leu Gly 


Ser 


Phe 


Pro 


Ser 


Leu 


Ala 






260 










265 










270 






Val 


Ala 


Lys 
275 


He 


He 


He 


He 


Glu 
280 


Phe 


Asn 


Pro 


Met 


Tyr 
285 


Pro 


Lys 


Asp 


Asn 


Asp 
290 


lie 


Ala 


Leu 


Met 


Lys 
295 


Leu 


Gin 


Phe 


Pro 


Leu 
300 


Thr 


Phe 


Ser 


Gly 


Thr 


Val 


Arg 


Leu 


He 


Cys 


Leu 


Pro 


Phe 


Phe 


Asp 


Glu 


Glu 


Leu 


Thr 


Pro 


305 








310 










315 










320 


Ala 


Thr 


Pro 


Leu 


Trp 
325 


He 


He 


Gly 


Trp 


Gly 
330 


Phe 


Thr 


Lys 


Gin 


Asn 
335 


Gly 


Gly 


Lys 


Met 


Ser 


Asp 


He 


Leu 


Leu 


Gin 


Ala 


Ser 


Val 


Gin 


Val 


He 


Asp 




340 










345 










350 






Ser 


Thr 


Arg 


Cys 


Asn 


Ala 


Asp 


Asp 


Ala 


Tyr 


Gin 


Gly Glu 


Val 


Thr 


Glu 






355 








360 










365 








Lys 


Met 


Met 


Cys 


Ala 


Gly 


He 


Pro 


Glu 


Gly 


Gly Val 


Asp 


Thr 


Cys 


Gin 


370 








375 










380 










Gly 


Asp 


Ser 


Gly Gly 


Pro 


Leu 


Met 


Tyr 


Gin 


Ser 


Asp 


Gin 


Trp 


His 


Val 


365 








390 










395 










400 


Val 


Gly 


lie 


Val 


Ser 


Trp 


Gly 


Tyr 


Gly 


Cys 


Gly 


Gly 


Pro 


Ser 


Thr 


Pro 








405 










410 






• 




415 




Gly 


Val 


Tyr 


Thr 


Lys 


Val 


Ser 


Ala 


Tyr 


Leu 


Asn 


Trp 


He 


Tyr 


Asn 


Val 




420 










425 










430 






Trp 


Lys 


Ala 
435 


Glu 


Leu 

























<210> 5 

<211> 708 

<212> DNA 

<213> Homo Sapien 

<220> 

<221> CDS 

<222> (1) . . . (708) 

<223> MTSP4 protease domain cDNA 
<400> 5 

att gtt ggt gga get gtg tec tec gag ggt gag tgg cca tgg cag gec 
He Val Gly Gly Ala Val Ser Ser Glu Gly Glu Trp Pro Trp Gin Ala 
15 10 15 



48 



age etc cag gtt egg ggt cga cac ate tgt ggg ggg gee etc ate get 96 

Ser Leu Gin Val Arg Gly Arg His He Cys Gly Gly Ala Leu He Ala 

20 25 30 

gac cgc tgg gtg ata aca get gee cac tgc ttc cag gag gac age atg 144 

Asp Arg Trp Val He Thr Ala Ala His Cys Phe Gin Glu Asp Ser Met 

35 40 45 
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gcc tec acg gtg ctg tgg acc gtg ttc ctg ggc aag gtg tgg cag aac 192 

Ala Ser Thr Val Leu Trp Thr Val Phe Leu Gly Lys Val Trp Gin Asn 
50 55 60 

teg cgc tgg cct gga gag gtg tec ttc aag gtg age cgc ctg etc ctg 24 0 

Ser Arg Trp Pro Gly Glu Val Ser Phe Lys Val Ser Arg Leu Leu Leu 
65 70 75 80 

cac ccg tac cac gaa gag gac age cat gac tac gac gtg gcg ctg ctg 2 88 

His Pro Tyr His Glu Glu Asp Ser His Asp Tyr Asp Val Ala Leu Leu 

85 90 95 

cag etc gac cac ccg gtg gtg cgc teg gcc gcc gtg cgc ccc gtc tgc 336 

Gin Leu Asp His Pro Val Val Arg Ser Ala Ala Val Arg Pro Val Cys 

100 105 110 

ctg ccc gcg cgc tec cac ttc ttc gag ccc ggc ctg cac tgc tgg att 384 

Leu Pro Ala Arg Ser His Phe Phe Glu Pro Gly Leu His Cys Trp lie 
115 120 125 

acg ggc tgg ggc gcc ttg cgc gag ggc ggc ccc ate age aac get ctg 432 
Thr Gly Trp Gly Ala Leu Arg Glu Gly Gly Pro lie Ser Asn Ala Leu 
130 135 140 

cag aaa gtg gat gtg cag ttg ate cca cag gac ctg tgc age gag gtc . 480 

Gin Lys Val Asp Val Gin Leu lie Pro Gin .Asp Leu Cys Ser Glu Val 
145 150 155 160 



tat cgc tac cag gtg acg cca cgc atg ctg tgt gcc ggc tac cgc aag 
Tyr Arg Tyr Gin Val Thr Pro Arg Met Leu Cys Ala Gly Tyr Arg Lys 

165 170 175 



aag.gca etc agt ggc cgc tgg ttc ctg gcg ggg ctg gtc age tgg ggc 
Lys Ala Leu Ser Gly Arg Trp Phe Leu Ala Gly Leu Val Ser Trp Gly 
195 200 .205 

ctg ggc tgt ggc egg cct aac tac ttc ggc gtc tac acc cgc ate aca 
Leu Gly Cys Gly Arg Pro Asn Tyr Phe Gly Val Tyr Thr Arg lie Thr 
210 215 220 

ggt gtg ate age tgg ate cag caa gtg gtg acc tga 
Gly Val lie Ser Trp lie Gin Gin Val Val Thr * 
225 230 235 



<210> 6 

<211> 235 

<212> PRT 

<213> Homo Sapien 

<400> 6 

lie Val Gly Gly Ala Val Ser Ser Glu Gly Glu Trp Pro Trp Gin Ala 

15 10 15 

Ser Leu Gin Val Arg Gly Arg His He Cys Gly Gly Ala Leu He Ala 

20 25 30 

Asp Arg Trp Val He Thr Ala Ala His Cys Phe Gin Glu Asp Ser Met 
35 40 45 



528 



ggc aag aag gat gcc tgt cag ggt gac tea ggt ggt ccg ctg gtg tgc 576 
>- Gly Lys Lys Asp Ala Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys 

180 185 190 



624 



672 



708 
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Ala 


Ser 


Thr 


Val 




50 






Rot* 

OCX 




■X x lj 


XT X. U 


c. c 
b t> 








rlxS 


fro 


xyr 


UXS 


I3XI1 


T - «***^ ^ 

1j6 LL 




111 s 








100 


Ijcu 


FjTO 




Arg 






115 




Thr 


Gly 


Trp 


Gly 




130 






Gin 


Lys 


Val 


Asp 


145 








Tyr 


Arg 


Tyr 


Gin 


Gly 


Lys 


Lys 


Asp 








180 


Lys 


Ala 


Leu 


Ser 






195 




Leu 


Gly 


Cys 


Gly 




210 






Gly 


Val 


lie 


Ser 


225 









Leu Trp Thr Val 
55 

Gly Glu Val Ser 
70 

Glu Glu Asp Ser 
85 

Pro Val Val Arg 

Ser His Phe Phe 

120 

Ala Leu Arg Glu 

135 

Val Gin Leu He 
150 

Val Thr Pro Arg 
165 

Ala Cys Gin Gly 

Gly Arg Trp Phe 

200 

Arg Pro Asn Tyr 
215 

Trp He Gin Gin 
230 





JJCU 












D U 




T ,vq 


Val 

V C* X 


Cpr 

kj ^ 










His 


Asp 


Tyr 


Asp 




90 






Gar 
OCX 


Axel 


nJ.ol 


V CLX 


1 AC 








blU 


Pro 


taXy 


Leu 


Gly 


Gly 


Pro 


He 








140 


Pro 


Gin 


Asp 


Leu 






155 




Met 


Leu 


Cys 


Ala 




170 






Asp 


Ser 


Gly 


Gly 


185 








Leu 


Ala 


Gly 


Leu 


Phe 


Gly 


Val 


Tyr 








220 


Val 


Val 


Thr 








235 





V CI X 


Trrj 


Gin 


Ran 


Arg 






Leu 








Q O 

oU 


V d J- 


i-vXcl 


XJC IX 


XJt^U. 












Pt*o 
X v 


Val 

V CLX 






110 






His 


Cys 


Trp 


He 


125 








Ser 


Asn 


Ala 


Leu 


Cys 


Ser 


Glu 


Val 






160 


Gly 


Tyr 


Arg 


Lys 






175 




Pro 


Leu 


Val 


Cys 




190 




Val 


Ser 


Trp 


Gly 


205 








Thr 


Arg 


lie 


Thr 



<210> 7 

<211> 3104 

<212> DNA 

<213> Homo Sapien 



<220> 
<221> CDS 

<222> (33) . . . (2441) 

<223> cDNA encoding :MTSP4-L (long form) splice variant 
<400> 7 

tcatcggcca gagggtgatc agtgagcaga ag atg ccc gtg gec gag gec ccc 53 

Met Pro Val Ala Glu Ala Pro 

1 5 

cag gtg get ggc ggg cag ggg gac gga ggt gat ggc gag gaa gcg gag 101 

Gin Val Ala Gly Gly Gin Gly Asp Gly Gly Asp Gly Glu Glu Ala Glu 

10 15 20 

ccg gag ggg atg ttc aag gec tgt gag gac tec aag aga aaa gec egg 14 9 

Pro Glu Gly Met Phe Lys Ala Cys Glu Asp Ser Lys Arg Lys Ala Arg 

25 30 35 

ggc tac etc cgc ctg gtg ccc ctg ttt gtg ctg ctg gee ctg etc gtg 197 

Gly Tyr Leu Arg Leu Val Pro Leu Phe Val Leu Leu Ala Leu Leu Val 

40 45 50 55 

ctg get teg gcg ggg gtg eta etc tgg tat ttc eta ggg tac aag gcg 245 

Leu Ala Ser Ala Gly Val Leu Leu Trp Tyr Phe Leu Gly Tyr Lys Ala 

60 65 70 

gag gtg atg gtc age cag gtg tac tea ggc agt ctg cgt gta etc aat 293 

Glu Val Met Val Ser Gin Val Tyr Ser Gly Ser Leu Arg Val Leu Asn 

75 80 85 

cgc cac ttc tec cag gat ctt acc cgc egg gaa tct agt gee ttc cgc 341 

Arg His Phe Ser Gin Asp Leu Thr Arg Arg Glu Ser Ser Ala Phe Arg 
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90 95 100 

agt gaa acc gcc aaa gcc cag aag atg etc aag gag etc ate ace age 
Ser Glu Thr Ala Lys Ala Gin Lys Met Leu Lys Glu Leu lie Thar Ser 
105 110 ' 115 



gag gga ccc etc acc tgc ttc ttc tgg ttc att etc caa ate ccc gag 
Glu Gly Pro Leu Thr Cys Phe Phe Trp Phe He Leu Gin He Pro Glu 

140 145 150 



gtg aaa gac ata get gca ttg aat tec acg ctg ggt tgt tac cgc tac 
Val Lys Asp He Ala Ala Leu Asn Ser Thr Leu Gly Cys Tyr Arg Tyr 
200 205 210 215 



ctg gcc tec age tgc ctg tgg cac ctg cag ggc ccc aag gac etc atg 
Leu Ala Ser Ser Cys Leu Trp His Leu Gin Gly Pro Lys Asp Leu Met 

235 240 245 



gcc atg tat gac gtg gcc ggg ccc ctg gag aag agg etc ate acc teg 

Ala Met Tyr Asp Val Ala Gly Pro Leu Glu Lys Arg Leu He Thr Ser 
265 270 275 

gtg tac ggc tgc age cgc cag gag ccc gtg gtg gag gtt ctg gcg teg 

Val Tyr Gly Cys Ser Arg Gin Glu Pro Val Val Glu Val Leu Ala Ser 

280 285 290 295 



389 



acc cgc ctg gga act tac tac aac tec age tec gtc tat tec ttt ggg 437 
Thr Arg Leu Gly Thr Tyr Tyr Asn Ser Ser Ser Val Tyr Ser Phe Gly 
120 125 130 135 



485 



cac cgc egg ctg atg ctg age ccc gag gtg gtg cag gca ctg ctg gtg 533 
His Arg Arg Leu Met Leu Ser Pro Glu Val Val Gin Ala Leu Leu Val 

155 160 165 

gag gag ctg ctg tec aca gtc aac age teg get gcc gtc ccc tac agg 581 
Glu Glu Leu Leu Ser Thr Val Asn Ser Ser Ala Ala Val Pro Tyr Arg 
170 175 180 

gcc gag tac gaa gtg gac ccc gag ggc eta gtg ate ctg gaa gcc agt 629 
Ala Glu Tyr Glu Val Asp Pro Glu Gly Leu Val He Leu Glu Ala Ser 
185 190 195 



677 



age tac gtg ggc cag ggc cag gtc etc egg ctg aag ggg cct gac cac 725 
Ser Tyr Val Gly Gin Gly Gin Val Leu Arg Leu Lys Gly Pro Asp His 

220 225 230 



773 



etc aaa etc egg ctg gag tgg acg ctg gca gag tgc egg gac cga ctg 821 
Leu Lys Leu Arg Leu Glu Trp Thr Leu Ala Glu Cys Arg Asp Arg Leu 
250 255 260 



869 



917 



ggg gcc ate atg gcg gtc gtc tgg aag aag ggc ctg cac age tac tac 965 
Gly Ala He Met Ala Val Val Trp Lys Lys Gly Leu His Ser Tyr Tyr 

300 305 310 

gac ccc ttc gtg etc tec gtg cag ccg gtg gtc ttc cag gcc tgt gaa 1013 
Asp Pro Phe Val Leu Ser Val Gin Pro Val Val Phe Gin Ala Cys Glu 

315 320 325 

gtg aac ctg acg ctg gac aac agg etc gac tec cag ggc gtc etc age 1061 
Val Asn Leu Thr Leu Asp Asn Arg Leu Asp Ser Gin Gly Val Leu Ser 
330 335 340 
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acc ccg tac ttc ccc age tac tac teg ccc caa acc cac tgc tec tgg 1109 
Thr Pro Tyr Phe Pro Ser Tyr Tyr Ser Pro Gin Thr His Cys Ser Trp 
345 350 355 

cac etc acg gtg ccc tct ctg gac tac ggc ttg gee etc tgg ttt gat 1157 
His Leu Thr Val Pro Ser Leu Asp Tyr Gly Leu Ala Leu Trp Phe Asp 
360 365 370 375 

gee tat gca ctg agg agg cag aag tat gat ttg ccg tgc acc cag ggc 1205 
Ala Tyr Ala Leu Arg Arg Gin Lys Tyr Asp Leu Pro Cys Thr Gin Gly 

380 385 390 

cag tgg acg ate cag aac agg agg ctg tgt ggc ttg cgc ate ctg cag 1253 
Gin Trp Thr lie Gin Asn Arg Arg Leu Cys Gly Leu Arg lie Leu Gin 

395 400 405 

ccc tac gec gag agg ate ccc gtg gtg gee acg gec ggg ate acc ate 1301 
Pro Tyr Ala Glu Arg lie Pro Val Val Ala Thr Ala Gly He Thr He 
410 415 420 

aac ttc acc tec cag ate tec etc acc ggg ccc ggt gtg egg gtg cac 1349 
Asn Phe Thr Ser Gin He Ser Leu Thr Gly Pro Gly Val Arg Val His 
425 430 435 

tat ggc ttg tac aac cag teg gac ccc tgc cct gga gag ttc etc tgt 13 97 

Tyr Gly Leu Tyr Asn Gin Ser Asp Pro Cys Pro Gly Glu Phe Leu Cys 
440 445 450 455 

tct gtg aat gga etc tgt gtc cct gee tgt gat ggg gtc aag gac tgc 1445 
Ser Val Asn Gly Leu Cys Val Pro Ala Cys Asp Gly Val Lys Asp Cys 

460 465 470 

ccc aac ggc ctg gat gag aga aac tgc gtt tgc aga gee aca ttc cag 1493 
Pro Asn Gly Leu Asp Glu Arg Asn Cys Val Cys Arg Ala Thr Phe Gin 

475 480 485 

tgc aaa gag gac age aca tgc ate tea ctg ccc aag gtc tgt gat ggg 1541 
Cys Lys Glu Asp Ser Thr Cys He Ser Leu Pro Lys Val Cys Asp Gly 
490 495 500 

cag cct gat tgt etc aac ggc age gac gaa gag cag tgc cag gaa ggg 1589 
Gin Pro Asp Cys Leu Asn Gly Ser Asp Glu Glu Gin Cys Gin Glu Gly 
505 510 515 

gtg cca tgt ggg aca ttc acc ttc cag tgt gag gac egg age tgc gtg 1637 
Val Pro Cys Gly Thr Phe Thr Phe Gin Cys Glu Asp Arg Ser Cys Val 
520 525 530 535 

aag aag ccc aac ccg cag tgt gat ggg egg ccc gac tgc agg gac ggc 1685 
Lys Lys Pro Asn Pro Gin Cys Asp Gly Arg Pro Asp Cys Arg Asp Gly 

540 545 550 



teg gat gag gag cac tgt gaa tgt ggc etc cag ggc ccc tec age cgc 
Ser Asp Glu Glu His Cys Glu Cys Gly Leu Gin Gly Pro Ser Ser Arg 

555 560 565 



1733 



att gtt ggt gga get gtg tec tec gag ggt gag tgg cca tgg cag gec 1781 
He Val Gly Gly Ala Val Ser Ser Glu Gly Glu Trp Pro Trp Gin Ala 
570 575 580 

age etc cag gtt egg ggt cga cac ate tgt ggg ggg gec etc ate get 182 9 

Ser Leu Gin Val Arg Gly Arg His He Cys Gly Gly Ala Leu He Ala 
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585 590 595 
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He 


cca 
Pro 


cag 
Gin 


gac 
Asp 


ctg 
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Cys 


age 
Ser 

725 


gag 
Glu 


gtc 
Val 


2213 


tat cgc 
Tyr Arg 


tac 
Tyr 
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gtg 
Val 
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cca 
Pro 
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Arg 
735 


atg 
Met 
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Ala 
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Gly 
740 
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Arg 


aag 
Lys 
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Gly 
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Lys 
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Asp 
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Ala 
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Gin 
750 
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Gly 
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Asp 


tea 
Ser 
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Gly 
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Gly 
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ctg 
Leu 
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Val 
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Cys 
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Lys 
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Ala 


etc 
Leu 


agt 
Ser 
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Gly Arg 
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tgg 
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ttc 
Phe 


ctg 
Leu 
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Ala 


ggg 

Gly 
770 


ctg 
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gtc 

Val 


age 
Ser 


tgg 

Trp 


ggc 

Gly 
775 


2357 


ctg ggc 
Leu Gly 


tgt 
Cys 


ggc 

Gly 


egg 
Arg 
780 


cct 
Pro 


aac 
Asn 


tac 
Tyr 


ttc 
Phe 


ggc 
Gly 
785 


gtc 

Val 


tac 
Tyr 


ace 
Thr 


cgc 
Arg 


ate 
He 
790 


aca 
Thr 


2405 


ggt gtg 
Gly Val 


ate 
He 


age 
Ser 
795 


tgg 
Trp 


ate 
He 


cag 
Gin 


caa 
Gin 


gtg 

Val 
800 


gtg 

Val 


ace 
Thr 


tga 


ggaactgccc 




2451 



ccctgcaaag cagggcccac ctcctggact cagagagccc agggcaactg ccaagcaggg 2511 

ggacaagtat tctggcgggg ggtgggggag agagcaggee ctgtggtggc aggaggggca 2571 

tcttgtttcg tccctgatgt ctgtccagta tggcaggagg atgagaagtg ccagcagttg 2631 

ggggtcaaga cgtcccttga ggacccaggc ccacacccag cccttttgcc tcccaattct 2691 

ctctcctccg tccccttcct ccactgctgc etaatgeaag gcagtggctc agcagcaaga 2751 

atgctggttc tacatcccga ggagtgtctg aggtgcgccc cactctgtac agaggctgtt 2811 

tgggcagect tgcctccaga gagcagattc cagcttegga agcccctggt ctaacttggg 2871 

atctgggaat ggaaggtgct cccatcggag gggaccctca gagccctgga gaetgecagg 2931 
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tgggcctgct gccactgtaa gccaaaaggt ggggaagtcc tgactccagg gtccttgccc 2991 
cacccctgcc tgccacctgg gccctcacag cccagaccct cactgggagg tgagctcagc 3051 
tgccctttgg aataaagctg cctgatgcaa aaaaaaaaaa aaaaaaaaaa aaa 3104 

<210> 8 

<211> 802 

<212> PRT 

<213> Homo Sapien 

<400> 8 

Met Pro Val Ala Glu Ala Pro Gin Val Ala Gly Gly Gin Gly Asp Gly 

15 10 15 

Gly Asp Gly Glu Glu Ala Glu Pro Glu Gly Met Phe Lys Ala Cys Glu 

20 25 30' 

Asp Ser Lys Arg Lys Ala Arg Gly Tyr Leu Arg Leu Val Pro Leu Phe 

35 40 45 

Val Leu Leu Ala Leu Leu Val Leu Ala Ser Ala Gly Val Leu Leu Trp 

50 55 60 

Tyr Phe Leu Gly Tyr Lys Ala Glu Val Met Val Ser Gin Val Tyr Ser 
65 70 75 80 

Gly Ser Leu Arg Val Leu Asn Arg His Phe Ser Gin Asp Leu Thr Arg 

85 90 95 

Arg Glu Ser Ser Ala Phe Arg Ser Glu Thr Ala Lys Ala Gin Lys Met 

100 105 . 110 

Leu Lys Glu Leu lie Thr Ser Thr Arg Leu Gly Thr Tyr Tyr Asn Ser 

115 120 125 

Ser Ser Val Tyr Ser Phe Gly Glu Gly Pro Leu Thr Cys Phe Phe Trp 

130 135 140 

Phe lie Leu Gin lie Pro Glu His Arg Arg Leu Met Leu Ser Pro Glu 
145 150 155 160 

Val Val Gin Ala Leu Leu Val Glu Glu Leu Leu Ser Thr Val Asn Ser 

165 170 175 

Ser Ala Ala Val Pro Tyr Arg Ala Glu Tyr Glu Val Asp Pro Glu Gly 

180 185 190 

Leu Val lie Leu Glu Ala Ser Val Lys Asp lie Ala .Ala Leu Asn Ser 

195 200 205 

Thr Leu Gly Cys Tyr Arg Tyr Ser Tyr Val Gly Gin Gly Gin Val Leu 

210 215 220 

Arg Leu Lys Gly Pro Asp His Leu Ala Ser Ser Cys Leu Trp His Leu 
225 230 235 240 

Gin Gly Pro Lys Asp Leu Met Leu Lys Leu Arg Leu Glu Trp Thr Leu 

245 250 255 

Ala Glu Cys Arg Asp Arg Leu Ala Met Tyr Asp Val Ala Gly Pro Leu 

260 265 270 

Glu Lys Arg Leu lie Thr Ser Val Tyr Gly Cys Ser Arg Gin Glu Pro 

275 280 285 

Val Val Glu Val Leu Ala Ser Gly Ala He Met Ala Val Val Trp Lys 

290 295 300 

Lys Gly Leu His Ser Tyr Tyr Asp Pro Phe Val Leu Ser Val Gin Pro 
305 310 315 320 

Val Val Phe Gin Ala Cys Glu Val Asn Leu Thr Leu Asp Asn Arg Leu 

325 330 335 

Asp Ser Gin Gly Val Leu Ser Thr Pro Tyr Phe Pro Ser Tyr Tyr Ser 

340 345 350 

Pro Gin Thr Hie Cys Ser Trp His Leu Thr Val Pro Ser Leu Asp Tyr 

355 360 365 

Gly Leu Ala Leu Trp Phe Asp Ala Tyr Ala Leu Arg Arg Gin Lys Tyr 

370 375 380 

Asp Leu Pro Cys Thr Gin Gly Gin Trp Thr He Gin Asn Arg Arg Leu 
385 390 395 400 

Cys Gly Leu Arg He Leu Gin Pro Tyr Ala Glu Arg He Pro Val Val 

405 410 415 
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T - - 
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Val 
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Gly 
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His 
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Ser 
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Ala 


Gly 








740 


Ser 


Gly 


Gly 


Pro 






755 




Ala 


Gly 


Leu 


Val 




770 






Gly 


Val 


Tyr 


Thr 


785 








Val 


Thr 







lie Thr lie Asn 

Arg Val His Tyr 

440 

Phe Leu Cys Ser 
455 

Lys Asp Cys Pro 
470 

Thr Phe Gin Cys 
485 

Cys Asp Gly Gin 

Gin Glu Gly Val 

520 

Ser Cys Val Lys 
535 

Arg Asp Gly Ser 
550 

Ser Ser Arg lie 
565 

Trp Gin Ala Ser 

Leu lie Ala Asp 

600 

Asp Ser Met Ala 
615 

Trp Gin Asn Ser 
630 

Leu Leu Leu His 
645 

Ala Leu Leu Gin 

Pro Val Cys Leu 

680 

Cys Trp lie Thr 
695 

Asn Ala Leu Gin 
710 

Ser Glu Val Tyr 
725 

Tyr Arg Lys Gly 

Leu Val Cys Lys 

760 

Ser Trp Gly Leu 
775 

Arg He Thr Gly 
790 



rUc 
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Ser 
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/IOC 
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Asn 


Val 


Asn Gly Leu 
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Gly Leu Asp 
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Asp 
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r n c 

505 
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Cys 
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Pro 








54 0 
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Glu 
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Val 


Gly Gly Ala 
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Val 


Arg 
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He 
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Val 
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Arg 


Trp 


Pro 


Gly 






635 




Pro 


Tyr 


His 


Glu 




650 








Asp 


His 


Pro 


*~ 
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Pro 


Ala 


Arg 


Ser 


Gly 


Trp 


Gly Ala 








700 


Lys 


Val 


Asp 


Val 






715 




Arg 


Tyr 


Gin 


Val 




730 






Lys 


Lys 


Asp 


Ala 


745 








Ala 


Leu 


Ser 


Gly 


Gly 


Cys 


Gly Arg 








780 


Val 


He 


Ser 


Trp 






795 





He Ser Leu Thr 
430 

Gin Ser Asp Pro 
445 

Cys Val Pro Ala 

Glu Arg Asn Cys 

480 

Thr Cys He Ser 
495 

Asn Gly Ser Asp 
510 

Phe Thr Phe Gin 
525 

Gin Cys Asp Gly 

Cys Glu Cys Gly 

560 

Val Ser Ser Glu 
575 

Gly Arg His He 
590 

Thr Ala Ala His 
605 

Trp Thr Val Phe 

Glu Val Ser Phe 

640 

Glu Asp Ser His 
655 

Val Val Arg Ser 
670 

His Phe Phe Glu 
685 

Leu Arg Glu Gly 

Gin Leu He Pro 

720 

Thr Pro Arg Met 
735 

Cys Gin Gly Asp 
750 

Arg Trp Phe Leu 
765 

Pro Asn Tyr Phe 

He Gin Gin Val 

800 



<210> 9 

<211> 2672 

<212> DNA 

<213> Homo Sapien 

<220> 
<221> CDS 

<222> (33) . . . (2009) 

<223> cDNA encoding: MTSP4-S (short form) splice variant 



<400> 9 
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tcatcggcca gagggtgatc agtgagcaga ag atg ccc gtg gcc gag gcc ccc 53 

Met Pro Val Ala Glu Ala Pro 
1 5 

cag gtg get ggc ggg cag ggg gac gga ggt gat ggc gag gaa gcg gag 101 
Gin Val Ala Gly Gly Gin Gly Asp Gly Gly Asp Gly Glu Glu Ala Glu 
10 15 20 

ccg gag ggg atg ttc aag gcc tgt gag gac tec aag aga aaa gcc egg 149 
Pro- Glu Gly Met Phe Lys Ala Cys Glu Asp Ser Lys Arg Lys Ala Arg 
25 30 35 

ggc tac etc cgc ctg gtg ccc ctg ttt gtg ctg ctg gcc ctg etc gtg 197 
Gly Tyr Leu Arg Leu Val Pro Leu Phe Val Leu Leu Ala Leu Leu Val 
40 45 50 55 

ctg get teg gcg ggg gtg eta etc tgg tat ttc eta ggg tac aag gcg 245 
Leu Ala Ser Ala Gly Val Leu Leu Trp Tyr Phe Leu Gly Tyr Lys Ala 

60 65 70 

gag gtg atg gtc age cag gtg tac tea ggc agt ctg cgt gta etc aat 293 
Glu Val Met Val Ser Gin Val Tyr Ser Gly Ser Leu Arg Val Leu Asn 

75 80 85 

cgc cac ttc tec cag gat ctt ace cgc egg gaa tct agt gcc ttc cgc ■ 341 

Arg His Phe Ser Gin Asp Leu Thr Arg Arg Glu Ser Ser Ala Phe Arg 
90 95 100 

agt gaa ace gcc aaa gcc cag aag atg etc aag gag etc ate acc age 3 89 

Ser Glu Thr Ala Lys Ala Gin Lys Met Leu Lys Glu Leu lie Thr Ser 
105 110 115 

acc cgc ctg gga act tac tac aac tec age tec gtc tat tec ttt ggg 437 
Thr Arg Leu Gly Thr Tyr Tyr Asn Ser Ser Ser Val Tyr Ser Phe Gly 
120 125 130 135 

gtg tac ggc tgc age cgc cag gag ccc gtg gtg gag gtt ctg gcg teg 4 85 

Val Tyr Gly Cys Ser Arg Gin Glu Pro Val Val Glu Val Leu Ala Ser 

140 145 150 

ggg gcc ate atg gcg gtc gtc tgg aag aag ggc ctg cac age tac tac 533 
Gly Ala He Met Ala Val Val Trp Lys Lys Gly Leu His Ser Tyr Tyr 

155 160 165 

gac ccc ttc gtg etc tec gtg cag ccg gtg gtc ttc cag gcc tgt gaa 581 
Asp Pro Phe Val Leu Ser Val Gin Pro Val Val Phe Gin Ala Cys Glu 
170 175 180 

gtg aac ctg acg ctg gac aac agg etc gac tec cag ggc gtc etc age 629 
Val Asn Leu Thr Leu Asp Asn Arg Leu Asp Ser Gin Gly Val Leu Ser 
185 190 195 

acc ccg tac ttc ccc age tac tac teg ccc caa acc cac tgc tec tgg 677 
Thr Pro Tyr Phe Pro Ser Tyr Tyr Ser Pro Gin Thr His Cys Ser Trp 
200 205 210 215 

cac etc acg gtg ccc tct ctg gac tac ggc ttg gcc etc tgg ttt gat 725 
His Leu Thr Val Pro Ser Leu Asp Tyr Gly Leu Ala Leu Trp Phe Asp 

220 225 230 

gcc tat gca ctg agg agg cag aag tat gat ttg ccg tgc acc cag ggc 773 
Ala Tyr Ala Leu Arg Arg Gin Lys Tyr Asp Leu Pro Cys Thr Gin Gly 
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235 240 245 

cag tgg acg ate cag aac agg agg ctg tgt ggc ttg cgc ate ctg cag 821 

Gin Trp Thr lie Gin Asn Arg Arg Leu Cys Gly Leu Arg lie Leu Gin 

250 255 260 

ccc tac gec gag agg ate ccc gtg gtg gec acg gec ggg ate ace ate 869 

Pro Tyr Ala Glu Arg lie Pro Val Val Ala Thr Ala Gly lie Thr lie 

265 270 275 

aac ttc acc tec cag ate tec etc ace ggg ccc ggt gtg egg gtg cac 917 

Asn Phe Thr Ser Gin lie Ser Leu Thr Gly Pro Gly Val Arg Val His 

280 285 290 295 

tat ggc ttg tac aac cag teg gac ccc tgc cct gga gag ttc etc tgt 965 

Tyr Gly Leu Tyr Asn Gin Ser Asp Pro Cys Pro Gly Glu Phe Leu Cys 

300 305 310 

tct gtg aat gga etc tgt gtc cct gec tgt gat ggg gtc aag gac tgc 1013 

Ser Val Asn Gly Leu Cys Val Pro Ala Cys Asp Gly Val Lys Asp Cys 

315 320 325 

ccc aac ggc ctg gat gag aga aac tgc gtt tgc aga gee aca ttc cag 1061 

Pro Asn Gly Leu Asp Glu Arg Asn Cys Val Cys Arg Ala Thr Phe Gin 

330 335 340 

tgc aaa gag gac age aca tgc ate tea ctg ccc aag gtc tgt gat ggg 1109 

Cys Lys Glu Asp Ser Thr Cys lie Ser Leu Pro Lys Val Cys Asp Gly 

345 350 355 

cag cct gat tgt etc aac ggc age gac gaa gag cag tgc cag gaa ggg 1157 

Gin Pro Asp Cys Leu Asn Gly Ser Asp Glu Glu Gin Cys Gin Glu Gly 

360 365 370 375 

gtg cca tgt ggg aca ttc acc ttc cag tgt gag gac egg age tgc gtg 1205 
Val Pro Cys Gly Thr Phe Thr Phe Gin Cys Glu Asp Arg Ser Cys Val 

380 385 390 

aag aag ccc aac ccg cag tgt gat ggg egg ccc gac tgc agg gac ggc 1253 

Lys Lys Pro Asn Pro Gin Cys Asp Gly Arg Pro Asp Cys Arg Asp Gly 

395 400 405 

teg gat gag gag cac tgt gaa tgt ggc etc cag ggc ccc tec age cgc 1301 

Ser Asp Glu Glu His Cys Glu Cys Gly Leu Gin Gly Pro Ser Ser Arg 

410 415 420 



att gtt ggt gga get gtg tec tec gag ggt gag tgg cca tgg cag gec 
He Val Gly Gly Ala Val Ser Ser Glu Gly Glu Trp Pro Trp Gin Ala 
425 430 435 



1349 



age etc cag gtt egg ggt cga cac ate tgt ggg ggg gec etc ate get 13 97 

Ser Leu Gin Val Arg Gly Arg His He Cys Gly Gly Ala Leu He Ala 
440 445 450 455 

gac cgc tgg gtg at a aca get gec cac tgc ttc cag gag gac age atg 1445 

Asp Arg Trp Val He Thr Ala Ala His Cys Phe Gin Glu Asp Ser Met 

460 465 470 

gee tec acg gtg ctg tgg acc gtg ttc ctg ggc aag gtg tgg cag aac 1493 

Ala Ser Thr Val Leu Trp Thr Val Phe Leu Gly Lys Val Trp Gin Asn 

475 480 485 
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teg cgc tgg cct gga gag gtg tec ttc aag gtg age cgc ctg etc ctg 1541 
Ser Arg Trp Pro Gly Glu Val Ser Phe Lys Val Ser Arg Leu Leu Leu 
490 495 500 

cac ccg tac cac gaa gag gac age cat gac tac gac gtg gcg ctg ctg 1589 
His Pro Tyr His Glu Glu Asp Ser His Asp Tyr Asp Val Ala Leu Leu 
505 510 515 

cag etc gac cac ccg gtg gtg cgc teg gec gec gtg cgc ccc gtc tgc 1637 
Gin Leu Asp His Pro Val Val Arg Ser Ala Ala Val Arg Pro Val Cys 
520 525 530 535 

ctg ccc gcg cgc tec cac ttc ttc gag ccc ggc ctg cac tgc tgg att 1685 
Leu Pro Ala Arg Ser His Phe Phe Glu Pro Gly Leu His Cys Trp lie 

540 545 550 

a cg ggc tgg ggc gee ttg cgc gag ggc ggc ccc ate age aac get ctg 1733 
Thr Gly Trp Gly Ala Leu Arg Glu Gly Gly Pro lie Ser Asn Ala Leu 

555 560 565 

cag aaa gtg gat gtg cag ttg ate cca cag gac ctg tgc age gag gtc 1781 
Gin Lys Val Asp Val Gin Leu lie Pro Gin Asp Leu Cys Ser Glu Val 
570 575 580 

tat cgc tac cag gtg acg cca cgc atg ctg tgt gee ggc tac cgc aag 1829 
Tyr Arg Tyr Gin Val Thr Pro Arg Met Leu Cys Ala Gly Tyr Arg Lys 
585 590 595 

ggc aag aag gat gee tgt cag ggt gac tea ggt ggt ccg ctg gtg tgc 1877 
Gly Lys Lys Asp Ala Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys 
600 605 610 615 

aag gca etc agt ggc cgc tgg ttc ctg gcg ggg ctg gtc age tgg ggc 1925 
Lys Ala Leu Ser Gly Arg Trp Phe Leu Ala Gly Leu Val Ser Trp Gly 

620 625 630 

c tg ggc tgt ggc egg cct aac tac ttc ggc gtc tac acc cgc ate aca 1973 
Leu Gly Cys Gly Arg Pro Asn Tyr Phe Gly Val Tyr Thr Arg lie Thr 

635 640 645 

ggt gtg ate age tgg ate cag caa gtg gtg acc tga ggaactgccc 2019 
Gly Val lie Ser Trp lie Gin Gin Val Val Thr * 
650 655 

ccctgcaaag cagggcccac ctcctggact cagagagccc agggcaactg ccaagcaggg 2079 

ggacaagtat tctggcgggg ggtgggggag agagcaggee ctgtggtggc aggaggggca 2139 

tcttgtttcg tccctgatgt ctgtccagta tggcaggagg atgagaagtg ccagcagttg 2199 

ggggtcaaga cgtcccttga ggacccaggc ccacacccag cccttttgcc tcccaattct 2259 

ctctcctccg tccccttcct ccactgctgc etaatgeaag gcagtggctc agcagcaaga 2319 

atgctggttc tacatcccga ggagtgtctg aggtgcgccc cactctgtac agaggctgtt 23 79 

tgggcagect tgcctccaga gagcagattc cagcttegga agcccctggt ctaacttggg 2439 

atctgggaat ggaaggtgct cccatcggag gggaccctca gagccctgga gaetgecagg 2499 

tgggectget gccactgtaa gecaaaaggt ggggaagtcc tgactccagg gtccttgccc 2559 

cacccctgcc tgccacctgg gccctcacag cccagaccct cactgggagg tgagctcagc 2619 

tgccctttgg aataaagctg ectgatgeaa aaaaaaaaaa aaaaaaaaaa aaa 2672 

<210> 10 

<211> 658 

<212> PRT 

<213> Homo Sapien 

<400> 10 
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Met Pro Val Ala Glu Ala Pro Gin Val Ala Gly Gly Gin Gly Asp Gly 

15 10 IS 

Gly Asp Gly Glu Glu Ala Glu Pro Glu Gly Met Phe Lys Ala Cys Glu 

20 25 30 

Asp Ser Lys Arg Lys Ala Arg Gly Tyr Leu Arg Leu Val Pro Leu Phe 

35 40 45 

Val Leu Leu Ala Leu Leu Val Leu Ala Ser Ala Gly Val Leu Leu Trp 

50 55 60 

Tyr Phe Leu Gly Tyr Lys Ala Glu Val Met Val Ser Gin Val Tyr Ser 
65 70 75 80 

Gly Ser Leu Arg Val Leu Asn Arg His Phe Ser Gin Asp Leu Thr Arg 

85 90 95 

Arg Glu Ser Ser Ala Phe Arg Ser Glu Thr Ala Lys Ala Gin Lys Met 

100 105 no 

Leu Lys Glu Leu lie Thr Ser Thr Arg Leu Gly Thr Tyr Tyr Asn Ser 

115 120 125 

Ser Ser Val Tyr Ser Phe Gly Val Tyr Gly Cys Ser Arg Gin Glu Pro 

130 135 140 

Val Val Glu Val Leu Ala Ser Gly Ala He Met Ala Val Val Trp Lys 
14S 150 155 160 

Lys Gly Leu His Ser Tyr Tyr Asp Pro Phe Val Leu Ser Val Gin Pro 

165 170 175 

Val Val Phe Gin Ala Cys Glu Val Asn Leu Thr Leu Asp Asn Arg Leu 

180 185 190 

Asp Ser Gin Gly Val Leu Ser Thr Pro Tyr Phe Pro Ser Tyr Tyr Ser 

195 200 205 

Pro Gin Thr His Cys Ser Trp His Leu Thr Val Pro Ser Leu Asp Tyr 

210 215 220 

Gly Leu Ala Leu Trp Phe Asp Ala Tyr Ala Leu Arg Arg Gin Lys Tyr 
225 230 235 240 

Asp Leu Pro Cys Thr Gin Gly Gin Trp Thr He Gin Asn Arg Arg Leu 

245 250 255 

Cys Gly Leu Arg lie Leu Gin Pro Tyr Ala Glu Arg He Pro Val Val 

260 265 270 

Ala Thr Ala Gly He Thr He Asn Phe Thr Ser Gin lie Ser Leu Thr 

275 280 285 

Gly Pro Gly Val Arg Val His Tyr Gly Leu Tyr Asn Gin Ser Asp Pro 

290 295 300 

Cys Pro Gly Glu Phe Leu Cys Ser Val Asn Gly Leu Cys Val Pro Ala 
305 310 315 320 

Cys Asp Gly Val Lys Asp Cys Pro Asn Gly Leu Asp Glu Arg Asn Cys 

325 330 335 

Val Cys Arg Ala Thr Phe Gin Cys Lys Glu Asp Ser Thr Cys He Ser 

340 345 350 

Leu Pro Lys Val Cys Asp Gly Gin Pro Asp Cys Leu Asn Gly Ser Asp 

355 360 365 

Glu Glu Gin Cys Gin Glu Gly Val Pro Cys Gly Thr Phe Thr Phe Gin 

370 375 380 

Cys. Glu Asp Arg Ser Cys Val Lys Lys Pro Asn Pro Gin Cys Asp Gly 
385 390 395 400 

Arg Pro Asp Cys Arg Asp Gly Ser Asp Glu Glu His Cys Glu Cys Gly 

405 410 415 

Leu Gin Gly Pro Ser Ser Arg He Val Gly Gly Ala Val Ser Ser Glu 

420 425 430 

Gly Glu Trp Pro Trp Gin Ala Ser Leu Gin Val Arg Gly Arg His He 

435 440 445 

Cys Gly Gly Ala Leu He Ala Asp Arg Trp Val He Thr Ala Ala His 
450 455 460 

Cys Phe Gin Glu Asp Ser Met Ala Ser Thr Val Leu Trp Thr Val Phe 
465 470 475 480 

Leu Gly Lys Val Trp Gin Asn Ser Arg Trp Pro Gly Glu Val Ser Phe 

485 490 495 
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535 
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550 
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Ser Glu Val Tyr 

Tyr Arg Lys Gly 
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Leu Val Cys Lys 
615 

Ser Trp Gly Leu 
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Ser 
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Ser 
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Cys 


Gin 


Gly 


Asp 


605 








Arg 


Trp 


Phe 


Leu 


Pro 


Asn 


Tyr 


Phe 
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<210> 11 

<211> 1656 

<212> DNA 

<213> Homo Sapien 

<220> 
<221> CDS 

<222> (268) . . . (1629) 

<223> DNA sequence encoding a transmembrane serine 
protease (MTSP-6) protein 

<400> 11 

cgcccgggca ggtcagtaac actgtggcct actatctctt ccgtggtgcc atctacattt 60 
ttgggactcg ggaattatga ctgtttttgg ttaatcgata ctgaatgcgc tttgtgtgga 120 
ctgtcgaatt tcaaagattt accgtatgac caagatgcac ctgatgctac aagtataaat 180 
aggggaacaa atgctttctg ttcttcctcg gctaaggagg tagaggtgga ggcggagccg 240 
gatgtcagag gtcctgaaat agtcacc atg ggg gaa aat gat ccg cct get gtt 294 

Met Gly Glu Asn Asp Pro Pro Ala Val 
1 5 

gaa gec ccc ttc tea ttc cga teg ctt ttt ggc ctt gat gat ttg aaa 342 
Glu Ala Pro Phe Ser Phe Arg Ser Leu Phe Gly Leu Asp Asp Leu Lys 
10 15 20 25 

ata agt cct gtt gca cca gat gca gat get gtt get gca cag ate ctg 390 
He Ser Pro Val Ala Pro Asp Ala Asp Ala Val Ala Ala Gin He Leu 

30 35 40 

tea ctg ctg cca ttg aag ttt ttt cca ate ate gtc att ggg ate att 43 8 

Ser Leu Leu Pro Leu Lys Phe Phe Pro He He Val He Gly He He 

45 50 55 

gca ttg ata tta gca ctg gee att ggt ctg ggc ate cac ttc gac tgc 486 
Ala Leu He Leu Ala Leu Ala He Gly Leu Gly He His Phe Asp Cys 
60 65 70 

tea ggg aag tac aga tgt cgc tea tec ttt aag tgt ate gag ctg ata 534 
Ser Gly Lys Tyr Arg Cys Arg Ser Ser Phe Lys Cys He Glu Leu He 
75 80 85 
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get cga tgt gac gga gtc teg gat tgc aaa gac ggg gag gac gag tac 
Ala Arg Cys Asp Gly Val Ser Asp Cys Lys Asp Gly Glu Asp Glu Tyr 
90 95 100 105 



582 



cgc tgt gtc egg gtg ggt ggt cag aat gee gtg etc cag gtg ttc aca 630 
Arg Cys Val Arg Val Gly Gly Gin Asn Ala Val Leu Gin Val Phe Thr 

110 lis 120 

get get teg tgg aag acc atg tgc tec gat gac tgg aag ggt cac tac 678 
Ala Ala Ser Trp Lys Thr Met Cys Ser Asp Asp Trp Lys Gly His Tyr 

125 130 135 

gca aat gtt gee tgt gee caa ctg ggt ttc cca age tat gta agt tea 726 
Ala Asn Val Ala Cys Ala Gin Leu Gly Phe Pro Ser Tyr Val Ser Ser 
140 145 150 

gat aac etc aga gtg age teg eta gag ggg cag ttc egg gag gag ttt 774 
Asp Asn Leu Arg Val Ser Ser Leu Glu Gly Gin Phe Arg Glu Glu Phe 
155 160 165 

gtg tec ate gat cac etc ttg cca gat gac aag gtg act gca tta cac 822 
val Ser lie Asp Has Leu Leu Pro Asp Asp Lys Val Thr Ala Leu His 
170 175 180 185 

cac tea gta tat gtg agg gag gga tgt gee tct ggc cac gtg gtt acc 870 
Hxs Ser Val Tyr Val Arg Glu Gly Cys Ala Ser Gly His Val Val Thr 

190 195 200 

ttg cag tgc aca gee tgt ggt cat aga agg ggc tac age tea cgc ate 918 
Leu Gin Cys Thr Ala Cys Gly His Arg Arg Gly Tyr Ser Ser Arg He 

205 210 215 

gtg ggt gga aac atg tec ttg etc teg cag tgg ccc tgg cag gec age 966 
Val Gly Gly Asn Met Ser Leu Leu Ser Gin Trp Pro Trp Gin Ala Ser 
220 225 230 

? tfc ~f 9 !£ c SI? 9 9 ? c tac cac ct9 tgc 99g ggc tct gtc ate acg ccc 1014 
Leu Gin Phe Gin Gly Tyr His Leu Cys Gly Gly Ser Val He Thr Pro 
235 240 245 

ctg tgg ate ate act get gca cac tgt gtt tat gac ttg tac etc ccc 1062 
Leu Trp He He Thr Ala Ala His Cys Val Tyr Asp Leu Tyr Leu Pro 
250 255 260 265 

aag tea tgg acc ate cag gtg ggt eta gtt tec ctg ttg gac aat cca 1110 
Lys Ser Trp Thr He Gin Val Gly Leu Val Ser Leu Leu Asp Asn Pro 

270 275 280 

gee cca tec cac ttg gtg gag aag att gtc tac cac age aag tac aaa 1158 
Ala Pro Ser Hxs Leu Val Glu Lys He Val Tyr His Ser Lys Tyr Lys 

285 290 295 

cca aag agg ctg ggc aat gac ate gee ctt atg aag ctg gee ggg cca 1206 
Pro Lys Arg Leu Gly Asn Asp He Ala Leu Met Lys Leu Ala Gly Pro 
300 305 310 

? tC S? 9 !£ C f at gaa at9 atc caa cct 9tg tgc ctg ccc aac tct gaa 1254 
Leu Thr Phe Asn Glu Met He Gin Pro Val Cys Leu Pro Asn Ser Glu 
3X5 320 325 

gag aac ttc ccc gat gga aaa gtg tgc tgg acg tea gga tgg ggg gec 1302 
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Glu Asn Phe Pro Asp Gly Lys Val Cys Trp Thr Ser Gly Trp Gly Ala 
330 335 340 345 

aca gag gat gga ggt gac gcc tec cct gtc ctg aac cac gcg gec gtc 1350 
Thr Glu Asp Gly Gly Asp Ala Ser Pro Val Leu Asn His Ala Ala Val 

350 355 360 

cct ttg att tec aac aag ate tgc aac cac agg gac gtg tac ggt ggc 1398 
Pro Leu lie Ser Asn Lys lie Cys Asn His Arg Asp Val Tyr Gly Gly 

365 370 375 

ate ate tec ccc tec atg etc tgc gcg ggc tac ctg acg ggt ggc gtg 1446 
lie lie Ser Pro Ser Met Leu Cys Ala Gly Tyr Leu Thr Gly Gly Val 
380 385 390 

* 

gac age tgc cag ggg gac age ggg ggg ccc ctg gtg tgt caa gag agg 1494 

Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys Gin Glu Arg 

395 400 405 

agg ctg tgg aag tta gtg gga gcg ace age ttt ggc ate ggc tgc gca 1542 
Arg Leu Trp Lys Leu Val Gly Ala Thr Ser Phe Gly lie Gly Cys Ala 
410 415 420 425 

gag gtg aac aag cct ggg gtg tac ace cgt gtc ace tec ttc ctg gac 1590 
Glu Val Asn Lys Pro Gly Val Tyr Thr Arg Val Thr Ser Phe Leu Asp 

430 435 440 

tgg ate cac gag cag atg gag aga gac eta aaa ace tga agaggaaggg 1639 
Trp lie His Glu Gin Met Glu Arg Asp Leu Lys Thr * 

445 450 

gataagtagc cacetga . 1656 

<210> 12 

<211> 453 

<212> PRT 

<213> Homo Sapien 

<400> 12 

Met Gly Glu Asn Asp Pro Pro Ala Val Glu Ala Pro Phe Ser Phe Arg 

15 10 15 

Ser Leu Phe Gly Leu Asp Asp Leu Lys lie Ser Pro Val Ala Pro Asp 

20 25 30 

Ala Asp Ala Val Ala Ala Gin lie Leu Ser Leu Leu Pro Leu Lys Phe 

35 40 45 

Phe Pro lie lie Val lie Gly lie lie Ala Leu lie Leu Ala Leu Ala 

50 55 60 

lie Gly Leu Gly lie His Phe Asp Cys Ser Gly Lys Tyr Arg Cys Arg 
65 70 75 80 

Ser Ser Phe Lys Cys He Glu Leu He Ala Arg Cys Asp Gly Val Ser 

85 90 95 

Asp Cys Lys Asp Gly Glu Asp Glu Tyr Arg Cys Val Arg Val Gly Gly 

100 105 iio 

Gin Asn Ala Val Leu Gin Val Phe Thr Ala Ala Ser Trp Lys Thr Met 

115 120 125 

Cys Ser Asp Asp Trp Lys Gly His Tyr Ala Asn Val Ala Cys Ala Gin 

130 135 140 

Leu Gly Phe Pro Ser Tyr Val Ser Ser Asp Asn Leu Arg Val Ser Ser 
145 150 155 160 

Leu Glu Gly Gin Phe Arg Glu Glu Phe Val Ser He Asp His Leu Leu 

165 170 175 

Pro Asp Asp Lys Val Thr Ala Leu His His Ser Val Tyr Val Arg Glu 
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Gly His Val Val 

200 

Tyr Ser Ser Arg 
215 

Pro Trp Gin Ala 
230 

Ser Val He Thr 
245 

Asp Leu Tyr Leu 

Leu Leu Asp Asn 
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His Ser Lys Tyr 
295 

Lys Leu Ala Gly 
310 

Leu Pro Asn Ser 
325 

Ser Gly Trp Gly 

Asn His Ala Ala 

360 

Asp Val Tyr Gly 
375 

Leu Thr Gly Gly 
390 

Val Cys Gin Glu 
405 

Gly lie Gly Cys 

Thr Ser Phe Leu 

440 

Thr 



185 

Thr Leu Gin Cys 

He Val Gly Gly 

220 

Ser Leu Gin Phe 
235 

Pro Leu Trp He 
250 

Pro Lys Ser Trp 
265 
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300 
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315 
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<210> 13 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Primer 

<221> misc_f eature 
<222> (0) ... (0) 
<223> N= Inosine 

<400> 13 

tggrtnvtnw sngcnrcnca ytg 23 

<210> 14 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Primer 

<221> mi sc_f eature 
<222> (0) . . . (0) 
<223> N= Inosine 
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<400> 14 

nggnccnccn swrtcnccyt nrcanghrtc 

<210> 15 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 
<400> 15 

tcaccgagaa gatgatgtgt gcaggcatcc 

<210> 16 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 
<400> 16 

gggacagggg ctgtaaggca gggaatgag 

<210> 17 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 
<400> 17 

cccgcagcca tagccccagc taacg 

<210> 18 
<211> 27 
<212> DNA 

<213> Aritificial Sequence 
<400> 18 

gcagacgatg cgtaccaggg ggaagtc 

<210> 19 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 
<400> 19 

ctcgagaaaa gagtggtggg tggggaggag gcctctgtg 

<210> 20 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 
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<400> 20 

gcggccgcat tacagctcag ccttccagac 30 

<210> 21 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Primer 
<400> 21 

cctccacggt gctgtggacc gtgttcc 27 

<210> 22 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Primer 
<400> 22 

cctcgcgcaa ggcgccccag cccg 24 

<210> 23 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Primer 
<400> 23 

gcgtggcgtc acctggtagc gatagacctc gc 32 

<210> 24 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Primer 
<400> 24 

cctccacggt gctgtggacc gtgttcc 27 

<210> 25 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Primer 
<400> 25 

cctcgcgcaa ggcgccccag cccg 24 

<210> 26 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Oligonucleotide Primer 
<400> 26 

tcatcggcca gagggtgatc agtgag 

<210> 27 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Primer 
<400> 27 

cctcctcagt gcataggcat caaaccag 

<210> 28 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Primer 
<400> 28 

tctctcgaga aaagaattgt tggtggagct gtgtcctccg ag 

<210> 29 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Primer 
<400> 29 

aggtgggcct tgctttgcag gggggcagtt c 

<210> 30 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Primer 
<400> 30 

tcacgcatcg tgggtggaac atgt cc 

<210> 31 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Primer 
<400> 31 

acccacctcc atctgctcgt ggatcc 
<210> 32 
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<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 
<400> 32 

ccacagcctc ctctcttgac acaccag 27 

<210> 33 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 
<400> 33 

acgcccctgt ggatcatcac tgctgc 26 

<210> 34 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 
<400> 34 

tccctccctc acatatactg agtggtg 27 

<21'0> 35 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 
<400> 35 

cgactgctca gggaagtcag atgtcg 26 

<210> 36 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 
<400> 36 

gcggccgcac tataccccag tgttctcttt gatcca 36 

<210> 37 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide primer 
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<400> 37 

ctggtgtgtc aagagaggag gctgtgg 27 

<210> 38 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 
<400> 38 

actcaggtgg ctacttatcc ccttcctc 28 

<210> 39 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Primer 
<400> 39 

tctctcgaga aaagagtggt gggtggggag gaggcctctg tg 42 



<210> 40 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Primer 
<400> 40 

attcgcggcc gcattacagc tcagccttcc agac 

<210> 41 
<211> 42 
<212> DNA 

<213> Artificial Sequence 



34 



<220> 

<223> Oligonucleotide Primer 
<400> 41 

tctctcgaga aaagaattgt tggtggagct gtgtcctccg ag 42 

<210> 42 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Primer 
<400> 42 

attcgcggcc gctcaggtca ccacttgctg gatccag 37 

<210> 43 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Oligonucleotide Primer 
<400> 43 

ctcgagaaac gcatcgtggg tggaaacatg tccttg 36 

<210> 44 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide Primer 
<400> 44 

actcaggtgg ctacttatcc ccttcctc 2 8 

<210> 45 
<211> 9276 
<212> DNA 

<213> Pichia pastoris 
<400> 45 

agatctaaca tccaaagacg aaaggttgaa tgaaaccttt ttgccatccg acatccacag 60 

gtccattctc acacataagt gccaaacgca acaggagggg atacactagc agcagaccgt 120 

tgcaaacgca ggacctccac tcctcttctc ctcaacaccc acttttgcca tcgaaaaacc 180 

agcccagtta ttgggcttga ttggagctcg ctcattccaa ttccttctat taggctacta 240 

acaccatgac tttattagcc tgtctatcct ggcccccctg gcgaggttca tgtttgttta 300 

tttccgaatg caacaagctc cgcattacac ccgaacatca ctccagatga gggctttctg 360 

agtgtggggt caaatagttt catgttcccc aaatggccca aaactgacag tttaaacgct 420 

gtcttggaac ctaatatgac aaaagcgtga tctcatccaa gatgaactaa gtttggttcg 480 

ttgaaatgct aacggccagt tggtcaaaaa gaaacttcca aaagtcgcca taccgtttgt 54 0 

cttgtttggt attgattgac gaatgctcaa aaataatctc attaatgctt agcgcagtct 600 

ctctatcgct tctgaacccc ggtgcacctg tgccgaaacg caaatgggga aacacccgct 660 

ttttggatga ttatgcattg tctccacatt gtatgcttcc aagattctgg tgggaatact 720 

gctgatagcc taacgttcat gatcaaaatt taactgttct aacccctact tgacagcaat 780 

atataaacag aaggaagctg ccctgtctta aacctttttt tttatcatca ttattagctt 840 

actttcataa ttgcgactgg ttccaattga caagcttttg attttaacga cttttaacga 900 

caacttgaga agatcaaaaa acaactaatt attcgaagga tccaaacgat gagatttcct 960 

tcaattttta ctgcagtttt attcgcagca tcctccgcat tagctgctcc agtcaacact 1020 

acaacagaag atgaaacggc acaaattccg gctgaagctg tcatcggtta ctcagattta 1080 

gaaggggatt tcgatgttgc tgttttgcca ttttccaaca gcacaaataa cgggttattg 1140 

tttataaata ctactattgc cagcattgct gctaaagaag aaggggtatc tctcgagaaa 1200 

agagaggctg aagcttacgt agaattccct agggcggccg cgaattaatt cgccttagac 1260 

atgactgttc ctcagttcaa gttgggcact tacgagaaga ccggtcttgc tagattctaa 1320 

tcaagaggat gtcagaatgc catttgcctg agagatgcag gcttcatttt tgatactttt 1380 

ttatttgtaa cctatatagt ataggatttt ttttgtcatt ttgtttcttc tcgtacgagc 1440 

ttgctcctga tcagcctatc tcgcagctga tgaatatctt gtggtagggg tttgggaaaa 1500 

tcattcgagt ttgatgtttt tcttggtatt tcccactcct cttcagagta cagaagatta 1560 

agtgagaagt tcgtttgtgc aagcttatcg ataagcttta atgcggtagt ttatcacagt 1620 

taaattgcta acgcagtcag gcaccgtgta tgaaatctaa caatgcgctc atcgtcatcc 1680 

tcggcaccgt caccctggat gctgtaggca taggcttggt tatgccggta ctgccgggcc 174 0 

tcttgcggga tatcgtccat tccgacagca tcgccagtca ctatggcgtg ctgctagcgc 1800 

tatatgcgtt gatgcaattt ctatgcgcac ccgttctcgg agcactgtcc gaccgctttg 1860 

gccgccgccc agtcctgctc gcttcgctac ttggagccac tatcgactac gcgatcatgg 1920 

cgaccacacc cgtcctgtgg atctatcgaa tctaaatgta agttaaaatc tctaaataat 1980 

taaataagtc ccagtttctc catacgaacc ttaacagcat tgcggtgagc atctagacct 2040 

tcaacagcag ccagatccat cactgcttgg ccaatatgtt tcagtccctc aggagttacg 2100 

tcttgtgaag tgatgaactt ctggaaggtt gcagtgttaa ctccgctgta ttgacgggca 2160 

tatccgtacg ttggcaaagt gtggttggta ccggaggagt aatctccaca actctctgga 2220 

gagtaggcac caacaaacac agatccagcg tgttgtactt gatcaacata agaagaagca 2280 

ttctcgattt gcaggatcaa gtgttcagga gcgtactgat tggacatttc caaagcctgc 2340 
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tcgtaggttg caaccgatag ggttgtagag tgtgcaatac acttgcgtac aatttcaacc 2400 

cttggcaact gcacagcttg gttgtgaaca gcatcttcaa ttctggcaag ctccttgtct 2460 

gtcatatcga cagccaacag aatcacctgg gaatcaatac catgttcagc ttgagacaga 2520 

aggtctgagg caacgaaatc tggatcagcg tatttatcag caataactag aacttcagaa 2580 

ggcccagcag gcatgtcaat actacacagg gctgatgtgt cattttgaac catcatcttg 2640 

gcagcagtaa cgaactggtt tcctggacca aatattttgt cacacttagg aacagtttct 2700 

gttccgtaag ccatagcagc tactgcctgg gcgcctcctg ctagcacgat acacttagca 2760 

ccaaccttgt gggcaacgta gatgacttct ggggtaaggg taccatcctt cttaggtgga 2820 

gatgcaaaaa caatttcttt gcaaccagca actttggcag gaacacccag catcagggaa 2880 

gtggaaggca gaattgcggt tccaccagga atatagaggc caactttctc aataggtctt 294 0 

gcaaaacgag agcagactac accagggcaa gtctcaactt gcaacgtctc cgttagttga 3 000 

gcttcatgga atttcctgac gttatctata gagagatcaa tggctctctt aacgttatct 3060 

ggcaattgca taagttcctc tgggaaagga gcttctaaca caggtgtctt caaagcgact 3120 

ccatcaaact tggcagttag ttctaaaagg gctttgtcac cattttgacg aacattgtcg 3180 

acaattggtt tgactaattc cataatctgt tccgttttct ggataggacg acgaagggca 3240 

tcttcaattt cttgtgagga ggccttagaa acgtcaattt tgcacaattc aatacgacct ■ 3300 

tcagaaggga cttctttagg tttggattct tctttaggtt gttccttggt gtatcctggc 3360 

ttggcatctc ctttccttct agtgaccttt agggacttca tatccaggtt tctctccacc 3420 

tcgtccaacg tcacaccgta cttggcacat ctaactaatg caaaataaaa taagtcagca 3480 

cattcccagg ctatatcttc cttggattta gcttctgcaa gttcatcagc ttcctcccta 3540 

attttagcgt tcaacaaaac ttcgtcgtca aataaccgtt tggtataaga accttctgga . 3 600 

gcattgctct tacgatccca caaggtggct tccatggctc taagaccett tgattggcca ■ 3660 

aaacaggaag tgcgttccaa gtgacagaaa ccaacacctg tttgttcaac cacaaatttc 3720 

aagcagtctc catcacaatc caattcgata cccagcaact tttgagttgc tccagatgta 3 780 

gcacctttat accacaaacc gtgacgacga gattggtaga ctccagtttg tgtccttata 384 0 

gcctccggaa tagacttttt ggacgagtac accaggccca acgagtaatt agaagagtca 3900 

gccaccaaag tagtgaatag accatcgggg cggtcagtag tcaaagacgc caacaaaatt 3960 

tcactgacag ggaacttttt gacatcttca gaaagttcgt attcagtagt caattgccga 4020 

gcatcaataa tggggattat accagaagca acagtggaag tcacatctac caactttgcg 4080 

gtctcagaaa aagcataaac agttctacta ccgccattag tgaaactttt caaatcgccc 4140 

agtggagaag aaaaaggcac agcgatacta gcattagcgg gcaaggatgc aactttatca 42 00 

accagggtcc tatagataac cctagcgcct gggatcatcc tttggacaac tctttctgcc 4260 

aaatctaggt ccaaaatcac ttcattgata ccattattgt acaacttgag caagttgtcg 4320 • 

atcagctcct caaattggfcc ctctgtaacg gatgactcaa cttgcacatt aacttgaagc 4380 

tcagtcgatt gagtgaactt gatcaggttg tgcagctggt cagcagcata gggaaacacg 4440 
gcttttccta ccaaactcaa ggaattatca aactctgcaa cacttgcgta tgcaggtagc ' 4500- 

■aagggaaatg tcatacttga agtcggacag tgagtgtagt cttgagaaat tctgaagccg 4560 

tatttttatt atcagtgagt cagtcatcag gagatcctct acgccggacg catcgtggcc 4620 ■ 

gacctgcagg gggggggggg- gcgctgaggt ctgcctcgtg aagaaggtgt tgctgactca 4680 

taccaggcct gaatcgcccc atcatccagc cagaaagtga gggagccacg gttgatgaga 4740 

gctttgttgt aggtggacca gttggtgatt ttgaactttt gctttgccac ggaacggtct 4800 
gcgttgtcgg gaagatgcgt gatctgatcc ttcaactcag caaaagttcg atttattcaa ■ 4860 

■ caaagccgcc gtcccgtcaa gtcagcgtaa tgctctgcca gtgttacaac caattaacca 4920 

attctgatta gaaaaactca tcgagcatca aatgaaactg caatttattc atatcaggat 4980 

tatcaatacc atatttttga aaaagccgtt tctgtaatga aggagaaaac tcaccgaggc 5040 

agttccatag gatggcaaga tcctggtatc ggtctgcgat tccgactcgt ccaacatcaa 5100 

tacaacctat taatttcccc tcgtcaaaaa taaggttatc aagtgagaaa tcaccatgag 5160 

- tgacgactga atccggtgag aatggcaaaa gcttatgcat ttctttccag acttgttcaa 5220 

.caggccagcc attacgctcg tcatcaaaat cactcgcatc aaccaaaccg ttattcattc 5280 

gtgattgcgc ctgagcgaga cgaaatacgc gatcgctgtt aaaaggacaa ttacaaacag 5340 

gaatcgaatg caaccggcgc aggaacactg ccagcgcatc aacaatattt tcacctgaat 5400 

caggatattc ttctaatacc tggaatgctg ttttcccggg gatcgcagtg gtgagtaacc 5460 

atgcatcatc aggagtacgg ataaaatgct tgatggtcgg aagaggcata aattccgtca 5520 

gccagtttag tctgaccatc tcatctgtaa catcattggc aacgctacct ttgccatgtt 5580 

tcagaaacaa ctctggcgca tcgggcttcc catacaatcg atagattgtc gcacctgatt 564 0 

gcccgacatt atcgcgagcc catttatacc catataaatc agcatccatg ttggaattta 5700 

atcgcggcct cgagcaagac gtttcccgtt gaatatggct cataacaccc cttgtattac '5760 

tgtttatgta agcagacagt tttattgttc atgatgatat atttttatct tgtgcaatgt 5820 

aacatcagag attttgagac acaacgtggc tttccccccc ccccctgcag gtcggcatca 5880 

ccggcgccac aggtgcggtt gctggcgcct atatcgccga catcaccgat ggggaagatc 5940 

gggctcgcca cttcgggctc atgagcgctt gtttcggcgt gggtatggtg gcaggccccg 6000 

tggccggggg actgttgggc gccatctcct tgcatgcacc attccttgcg gcggcggtgc 6060 
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tcaacggcct caacctacta ctgggctgct tcctaatgca ggagtcgcat aagggagagc 6120 

gtcgagtatc tatgattgga agtatgggaa tggtgatacc cgcattcttc agtgtcttga 6180 

ggtctcctat cagattatgc ccaactaaag caaccggagg aggagatttc atggtaaatt 6240 

tctctgactt ttggtcatca gtagactcga actgtgagac tatctcggtt atgacagcag 6300 

aaatgtcctt cttggagaca gtaaatgaag tcccaccaat aaagaaatcc ttgttatcag 6360 

gaacaaactt cttgtttcga actttttcgg tgccttgaac tataaaatgt agagtggata 6420 

tgtcgggtag gaatggagcg ggcaaatgct taccttctgg accttcaaga ggtatgtagg 6480 

gtttgtagat actgatgcca acttcagtga caacgttgct atttcgttca aaccattccg 6540 

aatccagaga aatcaaagtt gtttgtctac tattgatcca agccagtgcg gtcttgaaac 6600 

tgacaatagt gtgctcgtgt tttgaggtca tctttgtatg aataaatcta gtctttgatc 6660 

taaataatct tgacgagcca aggcgataaa tacccaaatc taaaactctt ttaaaacgtt 6720 

aaaaggacaa gtatgtctgc ctgtattaaa ccccaaatca gctcgtagtc tgatcctcat 6780 

caacttgagg ggcactatct tgttttagag aaatttgcgg agatgcgata tcgagaaaaa .6840 

ggtacgctga ttttaaacgt gaaatttatc tcaagatctc tgcctcgcgc gtttcggtga 6900 

tgacggtgaa aacctctgac acatgcagct cccggagacg gtcacagctt gtctgtaagc 6960 

ggatgccggg agcagacaag cccgtcaggg cgcgtcagcg ggtgttggcg ggtgtcgggg .7020 

cgcagccatg acccagtcac gtagcgatag cggagtgtat actggcttaa ctatgcggca 7080 

tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca cagatgcgta ■ 7140 

aggagaaaat accgcatcag gcgctcttcc gcttcctcgc tcactgactc gctgcgctcg 7200 

gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca 7260 

gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac 7320 

cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac 7380 

aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg 7440 

tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac 7500 

ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc aatgctcacg ctgtaggtat 7560 

ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag 7620 

cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac 7680 

ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt 7740 

gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt 7800 

atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc 7860 

aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga 792 0 

. aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac 7980 

gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt cacct agate 8040 

cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct 8100 

• gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca 8160 

, . tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct 8220 

y ggccccagtg ctgeaatgat accgcgagac ccacgctcac cggctccaga tttatcagca 8280 

ataaaccagc cagceggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc 8340 

• • atccagtcta ttaattgttg cegggaaget agagtaagta gttcgccagt taatagtttg • 8400 

cgcaacgttg ttgccattgc tgeaggcate- gtggtgtcac getegtegtt tggtatggct 8460 

- tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa 8520 

aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta 8580 

tcactcatgg ttatggcagc actgeataat tctcttactg tcatgccatc cgtaagatgc 8640 

ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg 8700 

agttgctctt gcccggcgtc aacaegggat aataccgcgc cacatagcag aactttaaaa 8760 

gtgetcatea ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg 8820 

agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc 8880 

accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg 8940 

gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat 9000 

cagggttatt gtctcatgag eggatacata tttgaatgta tttagaaaaa taaacaaata 9060 

9999ttccgc gcacatttcc ccgaaaagtg ccacctgacg tctaagaaac cattattatc 9120 

atgacattaa cctataaaaa taggegtate acgaggccct ttegtcttea agaattaatt 9180 

ctcatgtttg acagcttatc atcgataagc tgactcatgt tggtattgtg aaatagaege 9240 

agategggaa cactgaaaaa taacagttat tat teg 9276 

<210> 46 
<211> 3908 
<212> DNA 

<213> Escherichia coli 



<400> 46 

agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc 



60 
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acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc 120 

tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa 180 

ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac gccaagcttg 240 

gtaccgagct cggatccact agtaacggcc gccagtgtgc tggaattcgc ccttaagggc 3 00 

gaattctgca gatatccatc acactggcgg ccgctcgagc atgcatctag agggcccaat 360 

tcgccctata gtgagtcgta ttacaattca ctggccgtcg ttttacaacg tcgtgactgg 420 

gaaaaccctg gcgttaccca acttaatcgc cttgcagcac atcccccttt cgccagctgg 4 80 

cgtaatagcg aagaggcccg caccgatcgc ccttcccaac agttgcgcag cctgaatggc 540 

gaatgggacg cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc 600 

gtgaccgcta cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt 660 

ctcgccacgt tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc 720 

cgatttagag ctttacggca cctcgaccgc aaaaaacttg atttgggtga tggttcacgt 7 80 

agtgggccat cgccctgata gacggttttt cgccctttga cgttggagtc cacgttcttt 840 

aatagtggac tcttgttcca aactggaaca acactcaacc ctatcgcggt ctattctttt 900 

- gatttataag ggattttgcc gatttcggcc tattggttaa aaaatgagct gatttaacaa 960 

attcagggcg caagggctgc taaaggaacc ggaacacgta gaaagccagt ccgcagaaac 1020 

ggtgctgacc ccggatgaat gtcagctact gggctatctg gacaagggaa aacgcaagcg 10 80 

caaagagaaa gcaggtagct tgcagtgggc ttacatggcg atagctagac tgggcggttt 1140 

tatggacagc aagcgaaccg gaattgccag ctggggcgcc . ctctggtaag gttgggaagc 1200 

cctgcaaagt aaactggatg gctttcttgc cgccaaggat ctgatggcgc aggggatcaa 1260 

gatctgatca agagacagga tgaggatcgt ttcgcatgat tgaacaagat ggattgcacg 1320 

caggttctcc ggccgcttgg gtggagaggc tattcggcta tgactgggca caacagacaa 1380 

tcggctgctc tgatgccgcc gtgttccggc tgtcagcgca ggggcgcccg gttctttttg 1440 

tcaagaccga cctgtccggt gccctgaatg aactgcagga cgaggcagcg cggctatcgt 1500 

ggctggccac gacgggcgtt ccttgcgcag ctgtgctcga cgttgtcact gaagcgggaa 1560 

gggactggct gctattgggc gaagtgccgg ggcaggatct cctgtcatct cgccttgctc 1620 

ctgccgagaa agtatccatc atggctgatg caatgcggcg gctgcatacg cttgatccgg 1680 

ctacctgccc attcgaccac caagcgaaac atcgcatcga gcgagcacgt actcggatgg 1740 

aagccggtct tgtcgatcag gatgatctgg acgaagagca tcaggggctc gcgccagccg " 1800 

aactgttcg^ caggctcaag gcgcgcatgc ccgacggcga ggatctcgtc gtgatccatg 1860 

gcgatgcctg cttgccgaat atcatggtgg aaaatggccg cttttctgga ttcaacgact 1920 

gtggccggct gggtgtggcg gaccgctatc aggacatagc gttggatacc cgtgatattg 1980 

ctgaagagct tggcggcgaa tgggctgacc gcttcctcgt gctttacggt atcgccgctc 2040 

ccgattcgca gcgcatcgcc ttctatcgcc ttcttgacga gttcttctga attgaaaaag 2100 

gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg cggcattttg 2160 

ccttcctgtt tttgctcacc cagaaacgct .ggtgaaagta aaagatgctg aagatcagtt ■ 2220 

gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc ttgagagttt 2280 

tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat gtcatacact 2340 

attatcccgt attgacgccg ggcaagagca actcggtcgc cgggcgcggt attctcagaa 2400 

tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca tgacagtaag . 2460 

. agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact tacttctgac 2520 

aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg atcatgtaac 2580 

tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg agagtgacac - 2640 

cacgatgcct gtagcaatgc caacaacgtt gcgcaaacta ttaactggcg aactacttac 2700 

tctagct.tcc cggcaacaat taatagactg gatggaggcg gataaagttg caggaccact 2760 

tctgcgctcg gcccttccgg ctggctggtt tattgctgat aaatctggag ccggtgagcg 2820 

tgggtctcgc ggtatcattg cagcactggg gccagatggt aagccctccc gtatcgtagt 2880 

tatctacacg acggggagtc aggcaactat ggatgaacga aatagacaga tcgctgagat 2940 

aggtgcctca ctgattaagc attggtaact gtcagaccaa gtttactcat atatacttta 3000 

gattgattta aaacttcatt tttaatttaa aaggatctag gtgaagatcc tttttgataa 3060 

tctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga 3120 

aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac 3180 

aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac caactctttt 3240 

tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc tagtgtagcc 3300 

gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat 33 60 

cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag 3420 

acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc 3480 

cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc attgagaaag 3540 

cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac 3600 

aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata gtcctgtcgg 3660 

gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct 3720 

atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct ggccttttgc 3780 
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tcacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta ccgcctttga 3840 
gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag tgagcgagga 3900 
agcggaag 3908 

<210> 47 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 
<400> 47 

ggaattccat atgccgcgct ttaaagtggt gggtggggag gaggcc 46 

<210> 48 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 
<400> 48 

cgcgataccc gttacagctc agccttccag ac 32 

<210> 49 

<211> 3147 

<212> DNA 

<213> Homo Sapien 

<220> 
<221> CDS 

<222> .(1865) ... (2590) 

<223> Nucleic acid sequence of protease domain of MTSP1 
<400> 49* . ....... 

tcaagagcgg cctcggggta ccatggggag cgatcgggcc cgcaagggcg gagggggccc 60 

gaaggacttc ggcgcgggac • tcaagtacaa ctcccggcac . gagaaagtga atggcttgga 120 

.ggaaggcgtg gagttcctgc cagtcaacaa cgtcaagaag gtggaaaagc atggcccggg 180 

gcgctgggtg gtgctggcag ccgtgctgat cggcctcctc ttggtcttgc tggggatcgg 240 

cttcctggtg tggcatttgc agtaccggga cgtgcgtgtc cagaaggtct tcaatggcta 300 

catgaggatc acaaatgaga attttgtgga tgcctacgag aactccaact ccactgagtt 360 

tgtaagcctg gccagcaagg tgaaggacgc gctgaagctg ctgtacagcg gagtcccatt- 420 

cctgggcccc taccacaagg agtcggctgt gacggccttc agcgagggca gcgtcatcgc 480 

ctactactgg tctgagttca gcatcccgca gcacctggtg gaggaggccg - agcgcgtcat 540 

ggccgaggag cgcgtagtca tgctgccccc gcgggcgcgc tccctgaagt cctttgtggt 600 

cacctcagtg gtggctttcc ccacggactc caaaacagta cagaggaccc aggacaacag 660 

ctgcagcttt ggcctgcacg cccgcggtgt ggagctgatg cgctt caeca cgcccggctt 720 

ccctgacagc ccctaccccg ctcatgcccg ctgccagtgg gccctgcggg gggaegcega 780 

ctcagtgctg agcctcacct tccgcagctt tgaccttgcg tcctgcgacg agegeggcag 840 

cgacctggtg acggtgtaca acaccctgag ccccatggag ccccacgccc tggtgcagtt 900 

gtgtggcacc taccctccct cctacaacct gaccttccac tcctcccaga acgtcctgct 960 

catcacactg ataaccaaca ctgagcggcg gcatcccggc tttgaggeca ccttcttcca 1020 

getgectagg atgagcagct gtggaggccg ettaegtaaa geccagggga cattcaacag 1080 

cccctactac • ccaggccact acccacccaa cattgactgc acatggaaca ttgaggtgcc 1140 

caacaaccag catgtgaagg tgagcttcaa attcttctac ctgctggagc ccggcgtgcc 1200 

tgcgggcacc tgccccaagg actacgtgga gatcaatggg gagaaatact geggagagag 1260 

gtcccagttc gtcgt caeca gcaacagcaa caagatcaca gttcgcttcc actcagatca 132 0 

gtcctacacc gacaccggct tcttagctga atacctctcc tacgactcca gtgacccatg 1380 

cccggggcag ttcacgtgcc geaeggggeg gtgtatccgg aaggagctgc gctgtgatgg 1440 

ctgggccgac tgcaccgacc acagegatga gctcaactgc agttgcgacg ccggccacca 150 0 
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gttcacgtgc aagaacaagt tctgcaagcc cctcttctgg gtctgcgaca gtgtgaacga 1560 

ctgcggagac aacagcgacg agcaggggtg cagttgtccg gcccagacct tcaggtgttc 1620 

caatgggaag tgcctctcga aaagccagca gtgcaatggg aaggacgact gtggggacgg 1680 

gtccgacgag gcctcctgcc ccaaggtgaa cgtcgtcact tgtaccaaac acacctaccg 1740 

ctgcctcaat gggctctgct tgagcaaggg caaccctgag tgtgacggga aggaggactg 1800 

tagcgacggc tcagatgaga aggactgcga ctgtgggctg cggtcattca cgagacaggc 1860 

tcgt gtt gtt ggg ggc acg gat gcg gat gag ggc gag tgg ccc tgg cag 1909 
Val Val Gly Gly Thr Asp Ala Asp Glu Gly Glu Trp Pro Trp Gin 
1 5 10 15 

gta age ctg cat get ctg ggc cag ggc cac ate tgc ggt get tec etc 1957 
Val Ser Leu His Ala Leu Gly Gin Gly His lie Cys Gly Ala Ser Leu 

20 25 30 

ate tct ccc aac tgg ctg gtc tct gec gca cac tgc tac ate gat gac 2005 
lie Ser Pro Asn Trp Leu Val Ser Ala Ala His Cys Tyr lie Asp Asp 

35 40 45 

aga gga ttc agg tac tea gac ccc acg cag tgg acg gec ttc ctg ggc 2053 
Arg Gly Phe Arg Tyr Ser Asp Pro Thr Gin Trp Thr Ala Phe Leu Gly 
50 55 60 

ttg cac gac cag age cag cgc age gec cct ggg gtg cag gag cgc agg 2101 
Leu His Asp Gin Ser Gin Arg Ser Ala Pro Gly Val Gin Glu Arg Arg 
65 70 75 

etc aag cgc ate ate tec cac ccc ttc ttc aat gac ttc ace ttc gac 2149 
Leu Lys Arg lie lie Ser His Pro Phe Phe Asn Asp Phe Thr Phe Asp 
80 85 90 95 

tat gac ate gcg ctg ctg gag ctg gag aaa ccg gca gag tac age tec 2197 
Tyr Asp lie Ala Leu Leu Glu Leu Glu Lys Pro Ala Glu Tyr Ser Ser 

100 105 110 

atg gtg egg ccc ate tgc ctg ccg gac gec tec cat gtc ttc cct gec 2245 
Met Val Arg Pro lie Cys Leu Pro Asp Ala Ser His Val Phe Pro Ala 

115 120 125 

ggc aag gee ate tgg gtc acg ggc tgg gga cac acc cag tat gga ggc 2293 
Gly Lys Ala lie Trp Val Thr Gly Trp Gly His Thr Gin Tyr Gly Gly 
130 135 140 

act ggc gcg ctg ate ctg caa aag ggt gag ate cgc gtc ate aac cag 2341 
Thr Gly Ala Leu lie Leu Gin Lys Gly Glu lie Arg Val lie Asn Gin 
145 150 155 

acc acc tgc gag aac etc ctg ccg cag cag ate acg ccg cgc atg atg 2389 
Thr Thr Cys Glu Asn Leu Leu Pro Gin Gin lie Thr Pro Arg Met Met 
160 165 170 175 

tgc gtg ggc ttc etc age ggc ggc gtg gac tec tgc cag ggt gat tec 2437 
Cys Val Gly Phe Leu Ser Gly Gly Val Asp Ser Cys Gin Gly Asp Ser 

180 185 190 

BBS 99 a cc c ctg tec age gtg gag gcg gat ggg egg ate ttc cag gec 2485 
Gly Gly Pro Leu Ser Ser Val Glu Ala Asp Gly Arg lie Phe Gin Ala 

195 200 205 

ggt gtg gtg age tgg gga gac ggc tgc get cag agg aac aag cca ggc 2533 
Gly Val Val Ser Trp Gly Asp Gly Cys Ala Gin Arg Asn Lys Pro Gly 
210 215 220 
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gtg tac aca agg etc cct ctg ttt egg gac tgg ate aaa gag aac act 2581 
Val Tyr Thr Arg Leu Pro Leu Phe Arg Asp Trp lie Lys Glu Asn Thr 
225 230 235 

ggg gta tag gggcegggge cacccaaatg tgtacacctg cggggccacc 2630 

Gly Val * 

240 

catcgtccac cccagtgtgc acgcctgcag gctggagact ggaccgctga ctgcaccagc 2690 

gcccccagaa catacactgt gaactcaatc tccagggctc caaatctgcc tagaaaacct 2750 

ctcgcttcct cagcctccaa agtggagctg ggaggtagaa ggggaggaca ctggtggttc 2810 

tactgaccca actgggggca aaggtttgaa gacacagcct cccccgccag ccccaagctg 2 870 

ggccgaggcg cgtttgtgta tatctgcctc ccctgtctgt aaggagcagc gggaaeggag 2930 

cttcggagcc tcctcagtga aggtggtggg getgeeggat ctgggctgtg gggcccttgg 2990 

gccacgctct tgaggaagee caggctegga ggaccctgga aaacagaegg gtctgagact 3 050 

gaaattgttt taccagctcc cagggtggac ttcagtgtgt gtatttgtgt aaatgggtaa 3110 

aacaatttat ttctttttaa aaaaaaaaaa aaaaaaa 3147 



<210> 50 

<211> 241 

<212> PRT 

<213> Homo Sapien 



<400> 50 



Val 


Val 


Gly Gly 


Thr 


Asp 


Ala Asp 


Glu 


Gly Glu 


Trp 


Pro 


Trp 


Gin 


Val 


1 








5 








10 










15 




Ser 


Leu 


His 


Ala 


Leu 


Gly 


Gin Gly His 


He 


Cys 


Gly 


Ala 


Ser 


Leu 


He 








20 








25 










30 






Ser 


Pro 


Asn 


Trp 


Leu 


Val 


Ser Ala 


Ala 


His 


Cys 


Tyr 


He 


Asp 


Asp 


Arg 






35 






40 










45 








Gly 


Phe 


Arg 


Tyr 


Ser Asp 


Pro Thr 


Gin 


Trp 


Thr 


Ala 


Phe 


Leu 


Gly 


Leu 


50 










55 








60 










His 


Asp 


Gin 


Ser 


Gin 


Arg 


Ser Ala 


Pro 


Gly Val 


Gin 


Glu 


Arg 


Arg 


Leu 


65 








70 








75 










80 


Lys 


Arg 


He 


He 


Ser 


His 


Pro Phe 


Phe 


Asn 


Asp 


Phe 


Thr 


Phe 


Asp 


Tyr 






85 








90 










95 




Asp 


lie 


Ala 


Leu 


Leu 


Glu 


Leu Glu 


Lys 


Pro 


Ala 


Glu 


Tyr 


Ser 


Ser 


Met 






100 








105 










110 






Val 


Arg 


Pro 


He 


Cys 


Leu 


Pro Asp 


Ala 


Ser 


His 


Val 


Phe 


Pro 


Ala 


Gly 




115 






120 










125 








Lys 


Ala 


He 


Trp 


Val 


Thr 


Gly Trp 


Gly 


His 


Thr 


Gin 


Tyr 


Gly Gly 


Thr 


130 








135 








140 










Gly 


Ala 


Leu 


He 


Leu 


Gin 


Lys Gly 


Glu 


He 


Arg 


Val 


He 


Asn 


Gin 


Thr 


145 










150 






155 










160 


Thr 


Cys 


Glu 


Asn 


Leu 


Leu 


Pro Gin 


Gin 


He 


Thr 


Pro 


Arg 


Met 


Met 


Cys 








165 








170 










175 




Val 


Gly 


Phe 


Leu 


Ser 


Gly 


Gly Val 


Asp 


Ser 


Cys 


Gin 


Gly 


Asp 


Ser 


Gly 






180 








185 










190 






Gly 


Pro 


Leu 


Ser 


Ser 


Val 


Glu Ala 


Asp 


Gly Arg 


He 


Phe 


Gin 


Ala 


Gly 




195 








200 










205 








Val 


Val 


Ser 


Trp 


Gly Asp 


Gly Cys 


Ala 


Gin 


Arg 


Asn 


Lys 


Pro 


Gly 


Val 




210 










215 








220 










Tyr 


Thr 


Arg 


Leu 


Pro 


lieu 


Phe Arg 


Asp 


Trp 


He 


Lys 


Glu 


Asn 


Thr 


Gly 


225 








230 








235 










240 



Val 



<210> 51 

<211> 46 

<212> DNA 

<213> Artificial 



Sequence 
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<220> 

<223> Oligonucleoide Primer 
<400> 51 

tctctcgaga aaagagtggt gggtgggtgg ggaggaggcc tctgtg 

<210> 52 
<211> 43 
<212> DNA 

<213> Aritificial sequence 
<400> 52 

gctcctcatc aaagaagggc agagagatgg gcctgactgt gcc 

<210> 53 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleoide Primer 
<400> 53 

attcgcggcc gcattacagc tcagccttcc agac 

<210> 54 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleoide Primer 
<400> 54 

ggcacagtca ggcccatctc tctgcccttc tttgatgagg age 

<210> 55 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleoide Primer 
<400> 55 

caccccttct tcaatgactt caccttcg 

<210> 56 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleoide Primer 
<400> 56 

tacctctcct acgactcc 

<210> 57 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Oligonucleoide Primer 
<400> 57 

gaggttctcg caggtggtct ggttg 25 

<210> 58 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleoide Primer 
<400> 58 

ctcgagaaaa gagttgttgg gggcacggat gcggatgag 3 9 

<210> 59 

<211> 11 

<212> PRT 

<213> Homo Sapien 

<400> 59 

Phe Glu Val Phe Ser Gin Ser Ser Ser Leu Gly 
15 10 

<210> 60 

<211> 32 

<212> PRT 

<213> Homo Sapien 

<400> 60 

Glu lie Val Ala Pro Arg Glu Arg Ala Asp Arg Arg Gly Arg Lys Leu 

15 10 15 

Leu Cys Trp Arg Lys Pro Thr Lys Met Lys Gly Pro Arg Pro Ser His 

20 25 30 

<210> 61 

<211> 4933 

<212> DNA 

<213> Homo Sapien 

<220> 
<221> CDS 

<222> (94) . , . (3222) 

<223> Nucleotide sequence encoding corin 
<300> 

<308> GenBank AF133845 
<309> 1999-05-24 

<400> 61 

aaatcatccg tagtgcctcc ccgggggaca cgtagaggag agaaaagcga ccaagataaa 60 
agtggacaga agaataagcg agacttttta tec atg aaa cag tct cct gcc etc 114 

Met Lys Gin Ser Pro Ala Leu 
1 5 

get ccg gaa gag cgc tac cgc aga gcc ggg tec cca aag ccg gtc ttg 162 
Ala Pro Glu Glu Arg Tyr Arg Arg Ala Gly Ser Pro Lys Pro Val Leu 
10 15 20 

aga get gat gac aat aac atg ggc aat ggc tgc tct cag aag ctg gcg 210 
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Arg Ala Asp Asp Asn Asn Met Gly Asn Gly Cys Ser Gin Lys Leu Ala 

25 30 35 

act get aac etc etc egg ttc eta ttg ctg gtc ctg att cca tgt ate 258 

Thr Ala Asn Leu Leu Arg Phe Leu Leu Leu Val Leu lie Pro Cys lie 

40 45 50 55 

tgt get etc gtt etc ttg ctg gtg ate ctg ctt tec tat gtt gga aca 306 

Cys Ala Leu Val Leu Leu Leu Val lie Leu Leu Ser Tyr Val Gly Thr 

60 65 70 

tta caa aag gtc tat ttt aaa tea aat ggg agt gaa cct ttg gtc act 354 

Leu Gin Lys Val Tyr Phe Lys Ser. Asn Gly Ser Glu Pro Leu Val Thr 

75 80 85 

gat ggt gaa ate caa ggg tec gat gtt att ctt aca aat aca att tat 402 

Asp Gly Glu lie Gin Gly Ser Asp Val lie Leu Thr Asn Thr lie Tyr 

90 95 100 

aac cag age act gtg gtg tct act gca cat ccc gac caa cac gtt cca 450 

Asn Gin Ser Thr Val Val Ser Thr Ala His Pro Asp Gin His Val Pro 

105 110 115 

gec tgg act acg gat get tct etc cca ggg gac caa agt cac agg aat 498 

Ala Trp Thr Thr Asp Ala Ser Leu Pro Gly Asp Gin Ser His Arg Asn 

120 125 130 135 

aca agt gee tgt atg aac ate acc cac age cag tgt cag atg ctg ccc 546 

Thr Ser Ala Cys Met Asn lie Thr His Ser Gin Cys Gin Met Leu Pro 

140 145 150 

tac cac gec acg ctg aca cct etc etc tea gtt gtc aga aac atg gaa 594 

Tyr His Ala Thr Leu Thr Pro Leu Leu Ser Val Val Arg Asn Met Glu 

155 160 165 

atg gaa aag ttc etc aag ttt ttc aca tat etc cat cgc etc agt tgc 642 

Met Glu Lys Phe Leu Lys Phe Phe Thr Tyr Leu His Arg Leu Ser Cys 

170 175 180 

tat caa cat ate atg ctg ttt ggc tgt acc etc gee ttc cct gag tgc 690 

Tyr Gin His lie Met Leu Phe Gly Cys Thr Leu Ala Phe Pro Glu Cys 

185 190 195 

ate att gat ggc gat gac -agt cat gga etc ctg ccc tgt agg tec ttc 738 

lie lie Asp Gly Asp Asp Ser His Gly Leu Leu Pro Cys Arg Ser Phe 

200 205 210 215 

tgt gag get gca aaa gaa ggc tgt gaa tea gtc ctg ggg atg gtg aat 786 

Cys Glu Ala Ala Lys Glu Gly Cys Glu Ser Val Leu Gly Met Val Asn 

220 225 230 

tac tec tgg ccg gat ttc etc aga tgc tec cag ttt aga aac caa act 834 

Tyr Ser Trp Pro Asp Phe Leu Arg Cys Ser Gin Phe Arg Asn Gin Thr 

235 240 245 

gaa age age aat gtc age aga att tgc ttc tea cct cag cag gaa aac 882 

Glu Ser Ser Asn Val Ser Arg lie Cys Phe Ser Pro Gin Gin Glu Asn 

250 255 260 

gga aag caa ttg etc tgt gga agg ggt gag aac ttt ctg tgt gee agt 930 

Gly Lys Gin Leu Leu Cys Gly Arg Gly Glu Asn Phe Leu Cys Ala Ser 

265 270 275 
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gga ate tgc ate ccc ggg aaa ctg caa tgt aat ggc tac aac gac tgt 97 8 

Gly lie Cys lie Pro Gly Lys lieu Gin Cys Asn Gly Tyr Asn Asp Cys 

280 285 290 295 

gac gac tgg agt gac gag get cat tgc aac tgc age gag aat ctgttt 1026 

Asp Asp Trp Ser Asp Glu Ala His Cys Asn Cys Ser Glu Asn Leu Phe 

300 305 310 

cac tgt cac aca ggc aag tgc ctt aat tac age ctt gtg tgt gat gga 1074 

His Cys His Thr Gly Lys Cys Leu Asn Tyr Ser Leu Val Cys Asp Gly 

315 320 325 

tat gat gac tgt ggg gat ttg agt gat gag caa aac tgt gat tgc aat 1122 

Tyr Asp Asp Cys Gly Asp Leu Ser Asp Glu Gin Asn Cys Asp Cys Asn 
330 335 340 

ccc aca aca gag cat cgc tgc ggg gac ggg cgc tgc ate gec atg gag 1170 

Pro #hr Thr Glu His Arg Cys Gly Asp Gly Arg Cys lie Ala Met Glu 
345 350 355 

tgg gtg tgt gat ggt gac cac gac tgt gtg gat aag tec gac gag gtc 1218 

Trp Val Cys Asp Gly Asp His Asp Cys Val Asp Lys Ser Asp Glu Val 

360 365 370 375 

aac tgc tec tgt cac age cag ggt ctg gtg gaa tgc aga aat gga caa 1266 

Asn Cys Ser Cys His Ser Gin Gly Leu Val Glu Cys Arg Asn Gly Gin 

380 385 390 

tgt ate ccc age acg ttt caa tgt gat ggt gac gag gac tgc aag gat 1314 

Cys lie Pro Ser Thr Phe Gin Cys Asp Gly Asp Glu Asp Cys Lys Asp 

395 400 405 

999 a 9t 9 a t gag gag aac tgc age gtc att cag act tea tgt caa gaa 1362 

Gly Ser Asp Glu Glu Asn Cys Ser Val lie Gin Thr Ser Cys Gin Glu 
410 415 420 

gga gac caa aga tgc etc tac aat ccc tgc ctt gat tea tgt ggt ggt 1410 

Gly Asp Gin Arg Cys Leu Tyr Asn Pro Cys Leu Asp Ser Cys Gly Gly 
425 430 435 

age tct etc tgt gac ccg aac aac agt ctg aat aac tgt agt caa tgt 1458 

Ser Ser Leu Cys Asp Pro Asn Asn Ser Leu Asn Asn Cys Ser Gin Cys 

440 445 450 455 

gaa cca att aca ttg gaa etc tgc atg aat ttg ccc tac aac agt aca 1506 

Glu Pro lie Thr Leu Glu Leu Cys Met Asn Leu Pro Tyr Asn Ser Thr 

460 465 470 

agt tat cca aat tat ttt ggc cac agg act caa aag gaa gca tec ate 1554 

Ser Tyr Pro Asn Tyr Phe Gly His Arg Thr Gin Lys Glu Ala Ser lie 

475 480 485 

age tgg gag tct tct ctt ttc cct gca ctt gtt caa ace aac tgt tat 1602 

Ser Trp Glu Ser Ser Leu Phe Pro Ala Leu Val Gin Thr Asn Cys Tyr 
490 495 500 

aaa tac etc atg ttc ttt tct tgc acc att ttg gta cca aaa tgt gat 1650 

Lys Tyr Leu Met Phe Phe Ser Cys Thr lie Leu Val Pro Lys Cys Asp 
505 510 515 

gtg aat aca ggc gag cgt ate cct cct tgc agg gca ttg tgt gaa cac 1698 
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Val Asn Thr Gly Glu Arg lie Pro Pro Cys Arg Ala Leu Cys Glu His 
520 525 530 535 

tct aaa gaa cgc tgt gag tct gtt ctt ggg att gtg ggc eta cag tgg 1746 

Ser Lys Glu Arg Cys Glu Ser Val Leu Gly lie Val Gly Leu Gin Trp 

540 545 550 

cct gaa gac aca gat tgc agt caa ttt cca gag gaa aat tea gae aat 1794 

Pro Glu Asp Thr Asp Cys Ser Gin Phe Pro Glu Glu Asn Ser Asp Asn 

555 560 565 

caa acc tgc ctg. atg cct gat gaa tat gtg gaa gaa tgc tea cct agt 1842 

Gin Thr Cys Leu Met Pro Asp Glu Tyr Val Glu Glu CyB Ser Pro Ser 

570 575 580 

cat ttc aag tgc cgc tea gga cag tgt gtt ctg get tec aga aga tgt 1890 

His Phe Lys Cys Arg Ser Gly Gin Cys Val Leu Ala Ser Arg Arg Cys 
585 590 595 

gat ggc cag gee gac tgt gac gat gac agt gat gag gaa aac tgt ggt 1938 

Asp Gly Gin Ala Asp Cys Asp Asp Asp Ser Asp Glu Glu Asn Cys Gly 
600 605 610 615 

tgt aaa gag aga gat ctt tgg gaa tgt cca tec aat aaa caa tgt ttg 1986 

Cys Lys Glu Arg Asp Leu Trp Glu Cys Pro Ser Asn Lys Gin Cys Leu 

620 625 630 

aag cac aca gtg ate tgc gat ggg ttc cca gac tgc cct gat tac atg 2034 

Lys His Thr Val lie Cys Asp Gly Phe Pro Asp Cys Pro Asp Tyr Met 

635 640 645 



gac gag aaa aac tgc tea ttt tgc caa gat gat gag ctg gaa tgt gca 
Asp Glu Lys Asn Cys Ser Phe Cys Gin Asp Asp Glu Leu Glu Cys Ala 
650 655 660 



2082 



aac cat gcg tgt gtg tea cgt gac ctg tgg tgt gat ggt gaa gee gac 2130 
Asn His Ala Cys Val Ser Arg Asp Leu Trp Cys Asp Gly Glu Ala Asp 

665 670 675 

tgc tea gac agt tea gat gaa tgg gac tgt gtg acc etc tct ata aat 2178 

Cys Ser Asp Ser Ser Asp Glu Trp Asp Cys Val Thr Leu Ser lie Asn 

680 685 690 695 

gtg aac tec tct tec ttt ctg atg gtt cac aga get gee aca gaa cac 2226 

Val Asn Ser Ser Ser Phe Leu Met Val His Arg Ala Ala Thr Glu His 

700 705 710 

cat gtg tgt gca gat ggc tgg cag gag ata ttg agt cag ctg gee tgc 2274 

His Val Cys Ala Asp Gly Trp Gin Glu lie Leu Ser Gin Leu Ala Cys 

715 720 725 

aag cag atg ggt tta gga gaa cca tct gtg acc aaa ttg ata cag gaa 2322 

Lys Gin Met Gly Leu Gly Glu Pro Ser Val Thr Lys Leu lie Gin Glu 

730 735 740 

cag gag aaa gag ccg egg tgg ctg aca tta cac tec aac tgg gag age 2370 

Gin Glu Lys Glu Pro Arg Trp Leu Thr Leu His Ser Asn Trp Glu Ser 

745 750 755 

etc aat ggg acc act tta cat gaa ctt eta gta aat ggg cag tct tgt 2418 

Leu Asn Gly Thr Thr Leu His Giu Leu Leu Val Asn Gly Gin Ser Cys 

760 765 770 775 
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gag age aga agt aaa att tct ctt ctg tgt act aaa caa gac tgt ggg 2466 
Glu Ser Arg Ser Lys lie Ser Leu Leu Cys Thr Lys Gin Asp Cys Gly 

780 v 785 790 

cgc cgc cct get gec cga atg aac aaa agg ate ctt gga ggt egg acg 2514 
Arg Arg Pro Ala Ala Arg Met Asn Lys Arg lie Leu Gly Gly Arg Thr 

795 800 805 

agt cgc cct gga agg tgg cca tgg cag tgt tct ctg cag agt gaa ccc 2562 
Ser Arg Pro Gly Arg Trp Pro Trp Gin Cys Ser Leu Gin Ser Glu Pro 
810 815 820 

agt gga cat ate tgt ggc tgt gtc etc att gec aag aag tgg gtt ctg 2610 
Ser Gly His lie Cys Gly Cys Val Leu lie Ala Lys Lys Trp Val Leu 
825 830 835 

aca gtt gec cac tgc ttc gag ggg aga gag aat get gca gtt tgg aaa 2658 
Thr Val Ala His Cys Phe Glu Gly Arg Glu Asn Ala Ala Val Trp Lys 
840 845 850 855 

gtg gtg ctt ggc ate aac aat eta gac cat cca tea gtg ttc atg cag 2706 
Val Val Leu Gly lie Asn Asn Leu Asp His Pro Ser Val Phe Met Gin 

860 865 870 

aca cgc ttt gtg aag acc ate ate ctg cat ccc cgc tac agt cga gca 2754 
Thr Arg Phe Val Lys Thr lie lie Leu His Pro Arg Tyr Ser Arg Ala 

875 880 885 

gtg gtg gac tat gac ate age ate gtt gag ctg agt gaa gac ate agt 2802 
Val Val Asp Tyr Asp lie Ser lie Val Glu Leu Ser Glu Asp lie Ser 
890 895 900 

gag act ggc tac gtc egg cct gtc tgc ttg ccc aac ccg gag cag tgg 2850 
Glu Thr Gly Tyr Val Arg Pro Val Cys Leu Pro Asn Pro Glu Gin Trp 
905 910 915 

eta gag cct gac acg tac tgc tat ate aca ggc tgg ggc cac atg ggc 2898 
Leu Glu Pro Asp Thr Tyr Cys Tyr lie Thr Gly Trp Gly His Met Gly 
920 925 930 935 

aat aaa atg cca ttt aag ctg caa gag gga gag gtc cgc att att tct 2946 
Asn Lys Met Pro Phe Lys Leu Gin Glu Gly Glu Val Arg lie lie Ser 

940 945 950 

ctg gaa cat tgt cag tec tac ttt gac atg aag acc ate acc act egg 2994 
Leu Glu His Cys Gin Ser Tyr Phe Asp Met Lys Thr lie Thr Thr Arg 

955 960 965 

atg ata tgt get ggc tat gag tct ggc aca gtt gat tea tgc atg ggt 3042 
Met lie Cys Ala Gly Tyr Glu Ser Gly Thr Val Asp Ser Cys Met Gly 
970 975 980 

gac age ggt ggg cct ctt gtt tgt gag aag cct gga gga egg tgg aca 3090 
Asp Ser Gly Gly Pro Leu Val Cys Glu Lys Pro Gly Gly Arg Trp Thr 
985 990 995 

tta ttt gga tta act tea tgg ggc tec gtc tgc ttt tec aaa gtc ctg 313 8 

Leu Phe Gly Leu Thr Ser Trp Gly Ser Val Cys Phe Ser Lys Val Leu 
1000 1005 1010 1015 



ggg cct ggc gtt tat agt aat gtg tea tat ttc gtc gaa tgg att aaa 



3186 
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Gly Pro Gly Val Tyr Ser Asn Val Ser Tyr Phe Val Glu Trp He Lys 

1020 1025 1030 

aga cag att tac ate cag acc ttt etc eta aac taa ttataaggat 3232 
Arg Gin He Tyr He Gin Thr Phe Leu Leu Asn * 

1035 1040 

gatcagagac ttttgecage tacactaaaa gaaaatggcc ttcttgactg tgaagagctg 3292 

ectgeagaga gctgtacaga agcacttttc atggacagaa atgetcaate gtgeactgea 3352 

aatttgcatg tttgttttgg actaattttt ttcaatttat tttttcacct tcatttttct 3412 

cttatttcaa gttcaatgaa agactttaca aaagcaaaca aagcagactt tgtccttttg 3472 

ccaggcctaa ccatgactgc agcacaaaat tatcgactct ggcgagattt aaaatcaggt 3532 

gctacagtaa caggttatgg aatggtctct tttatcctat cacaaaaaaa gacatagata 3592 

tttaggctga ttaattatct ctaccagttt ttgtttctca agctcagtgc atagtggtaa 3652 

atttcagtgt taacattgga gaettgettt tctttttctt tttttatacc ccacaattct 3712 

tttttattac acttcgaatt ttagggtaca cgagcacaac gtgcaggtta gttacatatg 3772 

tatacatgtg ccatgttggt gtgetgaace cagtaactcg tcatttgatt tattaaaagc 3832 

caagataatt tacatgttta aagtatttac tattaccccc ttctaatgtt tgeataatte 3892 

tgagaactga taaaagacag caataaaaga ccagtgtcat ccatttaggt agcaagacat 3952 

attgaatgea aagttcttta gatatcaata ttaacacttg acattattgg accccccatt 4012 

ctggatgtat atcaagatca taattttata gaagagtctc tatagaactg tcctcatagc 4072 

tgggtttgtt caggatatat gagttggctg attgagactg caacaactac atctatattt 4132 

atgggcaata ttttgtttta cttatgtggc aaagaactgg atattaaact ttgeaaaaga 4192 

gaatttagat gagagatgea attttttaaa aagaaaatta atttgeatec ctcgtttaat 4252 

taaatttatt tttcagtttt ettgegttea tccataccaa caaagtcata aagagcatat 4312 

tttagagcac agtaagactt tgcatggagt aaaacatttt gtaattttcc tcaaaagatg 4372 

tttaatatct ggtttcttct cattggtaat taaaatttta gaaatgattt ttagctctag 4432 

gccactttac gcaactcaat ttctgaagca attagtggta aaaagtattt ttccccacta 4492 

aaaaacttta aaacacaaat cttcatatat acttaattta attagtcagg catccatttt 4552 

gecttttaaa caactaggat tccctactaa cctccaccag caacctggac tgcctcagca 4612 

ttccaaatag atactacctg caattttata catgtatttt tgtatctttt ctgtgtgtaa 4672 

acatagttga aattcaaaaa gttgtagcaa tttctatact attcatctcc tgtccttcag 4732 

tttgtataaa cctaaggaga gtgtgaaatc cagcaactga attgtggtca cgattgtatg 4792 

aaagttcaag aacatatgtc agttttgtta cagttgtagc tacatactca atgtatcaac 4852 

ttttagcctg ctcaacttag gctcagtgaa atatatatat tatacttatt ttaaataatt 4912 

cttaatacaa ataaaatggt a 4933 

<210> 62 

<211> 1042 

<212> PRT 

<213> Homo Sapien 

<400> 62 

Met Lys Gin Ser Pro Ala Leu Ala Pro Glu Glu Arg Tyr Arg Arg Ala 

15 10 15 

Gly Ser Pro Lys Pro Val Leu Arg Ala Asp Asp Asn Asn Met Gly Asn 

20 25 30 

Gly Cys Ser Gin Lys Leu Ala Thr Ala Asn Leu Leu Arg Phe Leu Leu 

35 40 45 

Leu Val Leu He Pro Cys He Cys Ala Leu Val Leu Leu Leu Val He 

50 55 60 

Leu Leu Ser Tyr Val Gly Thr Leu Gin Lys Val Tyr Phe Lys Ser Asn 
65 70 75 80 

Gly Ser Glu Pro Leu Val Thr Asp Gly Glu He Gin Gly Ser Asp Val 

85 90 95 

He Leu Thr Asn Thr He Tyr Asn Gin Ser Thr Val Val Ser Thr Ala 

100 105 110 

His Pro Asp Gin His Val Pro Ala Trp Thr Thr Asp Ala Ser Leu Pro 

115 120 125 

Gly Asp Gin Ser Hie Arg Asn Thr Ser Ala Cys Met Asn He Thr His 

130 135 140 

Ser Gin Cys Gin Met Leu Pro Tyr His Ala Thr Leu Thr Pro Leu Leu 
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145 

Ser Val Val Arg 

Tyr Leu His Arg 

180 

Thr Leu Ala Phe 
195 

Leu Leu Pro Cys 
210 

Ser Val Leu Gly 
225 

Ser Gin Phe Arg 

Phe Ser Pro Gin 

260 

Glu Asn Phe Leu 
275 

Cys Asn Gly Tyr 
290 

Asn Cys Ser Glu 
305 

Tyr Ser Leu Val 

Glu Gin Asn Cys 

340 

Gly Arg Cys lie 
355 

Val Asp Lys Ser 
370 

Val Glu Cys Arg 
385 

Gly Asp Glu Asp 

lie Gin Thr Ser 

420 

Cys Leu Asp Ser 
435 

Leu Asn Asn Cys 
450 

Asn Leu Pro Tyr 
465 

Thr Gin Lys Glu 

Leu Val Gin Thr 

500 

lie Leu Val Pro 
515 

Cys -Arg Ala Leu 
530 

Gly lie Val Gly 
545 

Pro Glu Glu Asn 

Val Glu Glu Cys 

580 

Val Leu Ala Ser 
595 

Ser Asp Glu Glu 
610 

Pro Ser Asn Lys 
625 

Pro Asp Cys Pro 



150 

Asn Met Glu Met 
165 

Leu Ser Cys Tyr 

Pro Glu Cys lie 

200 

Arg Ser Phe Cys 
215 

Met Val Asn Tyr 
230 

Asn Gin Thr Glu 
245 

Gin Glu Asn Gly 

Cys Ala Ser Gly 

280 

Asn Asp Cys Asp 
295 

Asn Leu Phe His 
310 

Cys Asp Gly Tyr 
325 

Asp Cys Asn Pro 

Ala Met Glu Trp 

360 

Asp Glu Val Asn 
375 

Asn Gly Gin Cys 
390 

Cys Lys Asp Gly 
405 

Cys Gin Glu Gly 

Cys Gly Gly Ser 

440 

Ser Gin Cys Glu 
455 

Asn Ser Thr Ser 
470 

Ala Ser lie Ser 
485 

Asn Cys Tyr Lys 

Lys Cys Asp Val 

520 

Cys Glu His Ser 
535 

Leu Gin Trp Pro 
550 

Ser Asp Asn Gin 
565 

Ser Pro Ser His 

Arg Arg Cys Asp 

600 

Asn Cys Gly Cys 
615 

Gin Cys Leu Lys 
630 

Asp Tyr Met Asp 



155 

Glu Lys Phe Leu 
170 

Gin His He Met 
185 

He Asp Gly Asp 

Glu Ala Ala Lys 

220 

Ser Trp Pro Asp 
235 

Ser Ser Asn Val 
250 

Lys Gin Leu Leu 
265 

He Cys He Pro 

Asp Trp Ser Asp 

300 

Cys His Thr Gly 
315 

Asp Asp Cys Gly 
330 

Thr Thr Glu His 
345 

Val Cys Asp Gly 

Cys Ser Cys His 

380 

He Pro Ser Thr 
395 

Ser Asp Glu Glu 
410 

Asp Gin Arg Cys 
425 

Ser Leu Cys Asp 

Pro He Thr Leu 

460 

Tyr Pro Asn Tyr 
475 

Trp Glu Ser Ser 
490 

Tyr Leu Met Phe 
505 

Asn Thr Gly Glu 

Lys Glu Arg Cys 

540 

Glu Asp Thr Asp 
555 

Thr Cys Leu Met 
570 

Phe Lys Cys Arg 
585 

Gly Gin Ala Asp 

Lys Glu Arg Asp 

620 

His Thr Val He 
635 

Glu Lys Asn Cys 



160 

Lys Phe Phe Thr 
175 

Leu Phe Gly Cys 
190 

Asp Ser His Gly 
205 

Glu Gly Cys Glu 

Phe Leu Arg Cys 

240 

Ser Arg He Cys 
255 

Cys Gly Arg Gly 
270 

Gly Lys Leu Gin 
285 

Glu Ala His Cys 

Lys Cys Leu Asn 

320 

Asp Leu Ser Asp 
335 

Arg Cys Gly Asp 
350 

Asp His Asp Cys 
365 

Ser Gin Gly Leu 

Phe Gin Cys Asp 

400 

Asn Cys Ser Val 
415 

Leu Tyr Asn Pro 
430 

Pro Asn Asn Ser 
445 

Glu Leu Cys Met 

Phe Gly His Arg 

480 

Leu Phe Pro Ala 
495 

Phe Ser Cys Thr 
510 

Arg He Pro Pro 
525 

Glu Ser Val Leu 

Cys Ser Gin Phe 

560 

Pro Asp Glu Tyr 
575 

Ser Gly Gin Cys 
590 

Cys Asp Asp Asp 
605 

Leu Trp Glu Cys 

Cys Asp Gly Phe 

640 

Ser Phe Cys Gin 
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645 650 655 



Asp 


Asp 


Glu 


Leu 


Glu 


Cys 


Ala 


Asn 


His 


Ala Cys Val Ser 


Arg 


Asp 


Leu 








660 










665 




670 








Cvs 




Gly 


Glu Ala Asp 


Cys 


Ser 


Asp Ser Ser Asp 


Glu 


Trp 


Asp 






675 










680 




685 








Cvs 


Val 


Thr 


Leu 


Ser 


He 


Asn 


Val 


Asn 


Ser Ser Ser Phe 


Leu 


Met 


Val 


690 










695 






700 








His 




Ala 


Ala 


Thr 


Glu 


His 


His 


Val 


Cys Ala Asp Gly 


Trp 


Gin 


Glu 


705 










710 








715 






720 


lie 


Leu 


Ser 


Gin 


Leu 


Ala 


Cys 


Lye 


Gin 


Met Gly Leu Gly 


Glu 


Pro 


Ser 










725 










730 




735 




Val 


Thr 


Lva 


Leu 


He 


Gin 


Glu 


Gin 


Glu 


Lys Glu Pro Arg 


Trp 


Leu 


Thr 








740 










745 




750 






Lieu 


His 


Ser 


Asn 


Trp 


Glu 


Ser 


Leu 


Asn 


Gly Thr Thr Leu His 


Glu 


Leu 






755 










760 




765 








T.pii 


Val 




Glv 


Gin 


Ser 


Cys 


Glu 


Ser 


Arg Ser Lys He 


Ser 


Leu 


Leu 




770 










775 






780 










Thr 




Gin 


Asp 


Cys 


Gly Arg Arg 


Pro Ala Ala Arg Met 


As Tl 




785 










790 








795 






800 






uc u 


Glv 


Gly Arg Thr 


Ser 


Arg 


Pro Gly Arg Trp 


Pro 


Trn 
±J -ir > 


Gin 










805 










810 




815 

\f -—J 








Leu 


Gin 


Ser 


Glu 


Pro 


Ser 


Gly 


His He Cys Gly 


Cys 


Val 


XrJ S»> \_X 








820 










825 




830 






He 


Ala 


Lys 


Lys 


Trp 


Val 


Leu 


Thr 


Val 


Ala His Cys Phe 


Glu 










835 










840 




845 








Glu 


Asn 


Ala 


Ala 


Val 


Trp 


Lys 


Val 


Val 


Leu Gly He Asn 


Asn 


lien 


Asn 




850 










855 






860 








Hie 


Pro 


Ser 


Val 


Phe 


Met 


Gin 


Thr 


Arg 


Phe Val Lys Thr 


He 


He 

-A- -A- V- 


Leti 


865 










870 








875 






880 


His 


Pro 


Arg 


Tyr 


Ser 


Arg 


Ala 


Val 


Val 


Asp Tyr Asp He 


Ser 


He 


Val 










885 










890 




895 




Glu 


Leu 


Ser 


Glu 


Asp 


He 


Ser 


Glu 


Thr 


Gly Tyr Val Arg 


Pro 


Val 


Cvs 








900 










905 




910 






Leu 


Pro 


Asn 


Pro 


Glu 


Gin 


Trp 


Leu 


Glu 


Pro Asp Thr Tyr 


Cys 


Tvr 

j. jr j_ 


He 






915 










920 




925 








Thr 


Gly 


Trp Gly 


His 


Met 


Gly Asn 


Lys 


Met Pro Phe Lys 


Leu 


Gin 


Glu 




930 










935 






940 








Gly Glu 


Val 


Arg 


He 


He 


Ser 


Leu 


Glu 


His Cys Gin Ser 


Tyr 


Phe 


Asp 


945 










950 








955 






960 


Met 


Lys 


Thr 


He 


Thr 


Thr 


Arg 


Met 


He 


Cys Ala Gly Tyr 


Glu 


Ser 


Gly 










965 










970 




975 




Thr 


Val 


Asp 


Ser 


Cys 


Met 


Gly Asp 


Ser 


Gly Gly Pro Leu 


Val 


Cys 


Glu 








980 










985 




990 






Lys 


Pro 


Gly Gly 


Arg 


Trp 


Thr 


Leu 


Phe 


Gly Leu Thr Ser 


Trp 


Gly 


Ser 






995 










1000 


1005 






Val 


Cys 


Phe 


Ser 


Lys 


Val 


Leu 


Gly 


Pro 


Gly Val Tyr Ser 


Asn 


Val 


Ser 




1010 








1015 




1020 








Tyr 


Phe 


Val 


Glu 


Trp 


He 


Lys 


Arg 


Gin 


He Tyr He Gin 


Thr 


Phe 


Leu 


1025 








1030 






1035 






1040 


Leu 


Asn 
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<212> DNA 

<213> Homo Sapien 

<220> 
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<223> Nucleotide sequence encoding human en tor kinase 
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<300> 

<308> GenBank HSU09860 
<309> 1995-06-03 

<400> 63 

accagacagt tcttaaatta gcaagccttc aaaaccaaaa atg ggg teg aaa aga 

Met Gly Ser Iiys Arg 
1 5 



get ttt gac ctt cag caa atg ata gat gag ate ttt eta tea age aat 
Ala Phe Asp Leu Gin Gin Met He Asp Glu He Phe Leu Ser Ser Asn 

90 95 100 

ctg aag aat gaa tat aag aac tea aga gtt tta caa ttt gaa aat ggc 
Leu Lys Asn Glu Tyr Lys Asn Ser Arg Val Leu Gin Phe Glu Asn Gly 

105 110 115 

age att ata gtc gta ttt gac ctt ttc ttt gec cag tgg gtg tea gat 
Ser He He Val Val Phe Asp Leu Phe Phe Ala Gin Trp Val Ser Asp 
120 125 130 

caa aat gta aaa gaa gaa ctg att caa ggc ctt gaa gca aat aaa tec 
Gin Asn Val Lys Glu Glu Leu He Gin Gly Leu Glu Ala Asn Lys Ser 
135 140 145 



55 



ggc ata tct tct agg cat cat tct etc age tec tat gaa ate atg ttt 103 
Gly He Ser Ser Arg His His Ser Leu Ser Ser Tyr Glu He Met Phe 

10 15 20 

gca get etc ttt gee ata ttg gta gtg etc tgt get gga tta att gca 151 
Ala Ala Leu Phe Ala He Leu Val Val Leu Cys Ala Gly Leu He Ala 

25 30 35 

gta tec tgc ctg aca ate aag gaa tec caa cga ggt gca gca ctt gga 199 
Val Ser Cys Leu Thr He Lys Glu Ser Gin Arg Gly Ala Ala Leu Gly 
40 45 50 

cag agt cat gaa gee aga gcg aca ttt aaa ata aca tec gga gtt aca 247 
Gin Ser His Glu Ala Arg Ala Thr Phe Lys He Thr Ser Gly Val Thr 
55 60 65 

tat aat cct aat ttg caa gac aaa etc tea gtg gat ttc aaa gtt ctt 295 
Tyr Asn Pro Asn Leu Gin Asp Lys Leu Ser Val Asp Phe Lys Val Leu 
70 75 80 85 



343 



391 



439 



487 



age caa ctg gtc act ttc cat att gat ttg aac age gtt gat ate eta 535 

Ser Gin Leu Val Thr Phe His He Asp Leu Asn Ser Val Asp He Leu 

150 155 160 165 

gac aag eta aca acc acc agt cat ctg gca act cca gga aat gtc tea 583 

Asp Lys Leu Thr Thr Thr Ser His Leu Ala Thr Pro Gly Asn Val Ser 

170 175 180 

ata gag tgc ctg cct ggt tea agt cct tgt act gat get eta acg tgt 631 

He Glu Cys Leu Pro Gly Ser Ser Pro Cys Thr Asp Ala Leu Thr Cys 

185 190 195 

ata aaa get gat tta ttt tgt gat gga gaa gta aac tgt cca gat ggt 679 

He Lys Ala Asp Leu Phe Cys Asp Gly Glu Val Asn Cys Pro Asp Gly 
200 205 210 



tct gac gaa gac aat aaa atg tgt gee aca gtt tgt gat gga aga ttt 



727 
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Ser Asp Glu Asp Asn Lys Met Cys Ala Thr Val Cys Asp Gly Arg Phe 
215 220 225 

ttg tta act gga tea tct ggg tct ttc cag get act cat tat cca aaa 775 
Leu Leu Thr Gly Ser Ser Gly Ser Phe Gin Ala Thr His Tyr Pro Lys 
230 235 240 245 

cct tct gaa aca agt gtt gtc tgc cag tgg ate ata cgt gta aac caa 823 
Pro Ser Glu Thr Ser Val Val Cys Gin Trp lie lie Arg Val Asn Gin 

250 255 260 

gga ctt tec att aaa ctg age ttc gat gat ttt aat aca tat tat aca 871 
Gly Leu Ser lie Lys Leu Ser Phe Asp Asp Phe Asn Thr Tyr Tyr Thr 

265 270 275 

gat ata tta gat att tat gaa ggt gta gga tea age aag att tta aga 919 
Asp lie Leu Asp lie Tyr Glu Gly Val Gly Ser Ser Lys lie Leu Arg 
280 285 290 

get tct att tgg gaa act aat cct ggc aca ata aga att ttt tec aac 967 
Ala Ser lie Trp Glu Thr Asn Pro Gly Thr lie Arg lie Phe Ser Asn 
295 300 305 

caa gtt act gee ace ttt ctt ata gaa tct gat gaa agt gat tat gtt 1015 
Gin Val Thr Ala Thr Phe Leu lie Glu Ser Asp Glu Ser Asp Tyr Val 
310 315 320 325 

ggc ttt aat gca aca tat act gca ttt aac age agt gag ctt aat aat 1063 
Gly Phe Asn Ala Thr Tyr Thr Ala Phe Asn Ser Ser Glu Leu Asn Asn 

330 335 340 

tat gag aaa att aat tgt aac ttt gag gat ggc ttt tgt ttc tgg gtc 1111 
Tyr Glu Lys lie Asn Cys Asn Phe Glu Asp Gly Phe Cys Phe Trp Val 

345 350 355 

cag gat eta aat gat gat aat gaa tgg gaa agg att cag gga age ace 1159 
Gin Asp Leu Asn Asp Asp Asn Glu Trp Glu Arg lie Gin Gly Ser Thr 
360 365 370 

ttt tct cct ttt act gga ccc aat ttt gac cac act ttt ggc aat get 1207 
Phe Ser Pro Phe Thr Gly Pro Asn Phe Asp His Thr Phe Gly Asn Ala 
375 380 385 

tea gga ttt tac att tct ace cca act gga cca gga ggg aga caa gaa 1255 
Ser Gly Phe Tyr lie Ser Thr Pro Thr Gly Pro Gly Gly Arg Gin Glu 
390 395 400 405 



cga gtg ggg ctt tta age etc cct ttg gac ccc act ttg gag cca get 
Arg Val Gly Leu Leu Ser Leu Pro Leu Asp Pro Thr Leu Glu Pro Ala 

410 415 420 



age att aat ate age aat gac caa aat atg gag aag aca gtt ttc caa 
Ser lie Asn lie Ser Asn Asp Gin Asn Met Glu Lys Thr Val Phe Gin 
440 445 450 

aag gaa gga aat tat gga gac aat tgg aat tat gga caa gta ace eta 
Lys Glu Gly Asn Tyr Gly Asp Asn Trp Asn Tyr Gly Gin Val Thr Leu 
455 460 465 



1303 



tgc ctt agt ttc tgg tat cat atg tat ggt gaa aat gtc cat aaa tta 1351 
Cys Leu Ser Phe Trp Tyr His Met Tyr Gly Glu Asn Val His Lys Leu 

425 430 435 



1399 



1447 
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aat gaa aca 
Asn Glu Thr 
470 

ate ctg agt 
lie Leu Ser 



tgc aat ggg 
Cys Asn Gly 



cca gaa ctt 
Pro Glu Leu 
520 

aat aca aca 
Asn Thr Thr 
535 

get ttc tgt 
Ala Phe Cys 
550 

ctt cat ttt 
Leu His Phe 



at a aga gat 
lie Arg Asp 



ggg cct ggc 
Gly Pro Gly 
600 

gtg ctt etc 
Val Leu Leu 
615 

aac ttt act 
Asn Phe Thr 
630 

gac cat ttt 
Asp His Phe 



tgt gac ggt 
Cys Asp Gly 



gtg cgt ttt 
Val Arg Phe 
680 

aga ate cag 
Arg lie Gin 
695 

cag att tea 



gtt aaa ttt 
Val Lys Phe 
475 

gat att gcg 
Asp lie Ala 
490 

agt ctt tat 
Ser Leu Tyr 
505 

cct acg gac 
Pro Thr Asp 



ttc agt tct 
Phe Ser Ser 



gtt tgg att 
Val Trp lie 
555 

caa gaa ttt 
Gin Glu Phe 
570 

ggt gaa gaa 
Gly Glu Glu 
585 

cca gta aag 
Pro Val Lys 



ate act aac 
lie Thr Asn 



act ggc tat 
Thr Gly Tyr 
635 

caa tgt aaa 
Gin Cys Lys 
650 

cat ctg cac 
His Leu His 
665 

ttc aat ggc 
Phe Asn Gly 



age ata tgg 
Ser lie Trp 



aat gat gtt 



aag gtt get 
Lys Val Ala 



ttg gat gac 
Leu Asp Asp 



cca gaa cca 
Pro Glu Pro 
510 

tgt gga gga 
Cys Gly Gly 
525 

acg aac ttt 
Thr Asn Phe 
540 

tta aat gca 
Leu Asn Ala 



gac tta gaa 
Asp Leu Glu 



get gat tec 
Ala Asp Ser 
590 

gat gtg ttc 
Asp Val Phe 
605 

gat gtg ttg 
Asp Val Leu 
620 

cac ttg ggg 
His Leu Gly 



aat gga gag 
Asn Gly Glu 



tgt gag gat 
Cys Glu Asp 
670 

aca acg aac 
Thr Thr Asn 
685 

cat aca get 
His Thr Ala 
700 

tgt caa ctg 



ttt aat get 
Phe Asn Ala 
480 

att age eta 
lie Ser Leu 
495 

act ttg gtg 
Thr Leu Val 



cct ttt gag 
Pro Phe Glu 



cca aac age 
Pro Asn Ser 
545 

caa aaa gga 
Gin Lys Gly 
560 

aat att aac 
Asn lie Asn 
575 

ttg etc tta 
Leu Leu Leu 



tct ace ace 
Ser Thr Thr 



gca aga. gga 
Ala Arg Gly 
625 

att cca gag 
lie Pro Glu 
640 

tgt gtt cca 
Cys Val Pro 
655 

ggc tea gat 
Gly Ser Asp 



aac aat ggt 
Asn Asn Gly 



tgt get gag 
Cys Ala Glu 
705 

ctg gga eta 



ttt aaa aac 
Phe Lys Asn 



aca tat ggg 
Thr Tyr Gly 
500 

cca act cct 
Pro Thr Pro 
515 

ctg tgg gag 
Leu Trp Glu 
530 

tac cct aat 
Tyr Pro Asn 



aag aat ata 
Lys Asn lie 



gat gta gtt 
Asp Val Val 
580 

get gtg tac 
Ala Val Tyr 
595 

aac aga atg 
Asn Arg Met 
610 

ggg ttt aaa 
Gly Phe Lys 



cca tgc aag 
Pro Cys Lys 



ctg gtg aat 
Leu Val Asn 
660 

gaa gca gat 
Glu Ala Asp 
675 

tta gtg egg 
Leu Val Arg 
690 

aac tgg ace 
Asn Trp Thr 



ggg agt gga 



aag 1495 

Lys 

485 

att 1543 
He 



cca 1591 
Pro 



cca 1639 
Pro 



ctg 1687 
Leu 



caa 1735 

Gin 

565 

gaa 1783 
Glu 



aca 1831 
Thr 



act 1879 
Thr 



gca 1927 
Ala 



gca 1975 

Ala 

645 

etc 2023 
Leu 

tgt * 2071 
Cys 



ttc 2119 
Phe 



acc 2167 
Thr 



aac 2215 
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Gin lie Ser Asn Asp Val Cys Gin Leu Leu Gly Leu Gly Ser Gly Asn 
710 715 720 725 

tea tea aag cca ate ttc tct ace gat ggt gga cca ttt gtc aaa tta 2263 
Ser Ser Lys Pro lie Phe Ser Thr Asp Gly Gly Pro Phe Val Lys Leu 

' 730 735 740 

aac aca gca cct gat ggc cac tta at a eta aca ccc agt caa cag tgt 2311 
Asn Thr Ala Pro Asp Gly His Leu lie Leu Thr Pro Ser Gin Gin Cys 

745 750 755 

tta cag gat tec ttg att egg tta cag tgt aac cat aaa tct tgt gga 2359 
Leu Gin Asp Ser Leu lie Arg Leu Gin Cys Asn His Lys Ser Cys Gly 
760 765 770 

aaa aaa ctg gca get caa gac ate ace cca aag att gtt gga gga agt 2407 
Lys Lys Leu Ala Ala Gin Asp lie Thr Pro Lys lie Val Gly Gly Ser 
775 780 785 

aat gee aaa gaa ggg gee tgg ccc tgg gtt gtg ggt ctg tat tat ggc 2455 
Asn Ala Lys Glu Gly Ala Trp Pro Trp Val Val Gly Leu Tyr Tyr Gly 
790 795 800 805 

ggc cga ctg etc tgc ggc gca tct etc gtc age agt gac tgg ctg gtg 2503 
Gly Arg Leu Leu Cys Gly Ala Ser Leu Val Ser Ser Asp Trp Leu Val 

810 815 . 820 

tec gqc gca cac tgc gtg tat ggg aga aac tta gag cca tec aag tgg 2551 
Ser Ala Ala His Cys Val Tyr Gly Arg Asn Leu Glu Pro Ser Lys Trp 

825 830 835 

aca gca ate eta ggc ctg cat atg aaa tea aat ctg ace tct cct caa 2599 
Thr Ala lie Leu Gly Leu His Met Lys Ser Asn Leu Thr Ser Pro Gin 
840 845 850 

aca gtc cct cga tta ata gat gaa att gtc ata aac cct cat tac aat 2647 
Thr Val Pro Arg Leu lie Asp Glu lie Val lie Asn Pro His Tyr Asn 
855 860 865 

agg cga aga aag gac aac gac att gee atg atg cat ctg gaa ttt aaa 2695 
Arg Arg Arg Lys Asp Asn Asp lie Ala Met Met His Leu Glu Phe Lys 
870 875 880 885 

gtg aat tac aca gat tac ata caa cct att tgt tta ccg gaa gaa aat 2743 
Val Asn Tyr Thr Asp Tyr lie Gin Pro lie Cys Leu Pro Glu Glu Asn 

890 895 900 

caa gtt ttt cct cca gga aga aat tgt tct att get ggt tgg ggg acg 2791 
Gin Val Phe Pro Pro Gly Arg Asn Cys Ser lie Ala Gly Trp Gly Thr 

905 910 915 

gtt gta tat caa ggt act act gca aac ata ttg caa gaa get gat gtt 2839 
Val Val Tyr Gin Gly Thr Thr Ala Asn He Leu Gin Glu Ala Asp Val 
920 925 930 

cct ctt eta tea aat gag aga tgc caa cag cag atg cca gaa tat aac 2887 
Pro Leu Leu Ser Asn Glu Arg Cys Gin Gin Gin Met Pro Glu Tyr Asn 
935 940 945 

att act gaa aat atg ata tgt gca ggc tat gaa gaa gga gga ata gat 2935 
He Thr Glu Asn Met He Cys Ala Gly Tyr Glu Glu Gly Gly He Asp 
950 955 960 965 
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tct tgt cag ggg gat tea gga gga cca tta atg tgc caa gaa aac aac 2983 
Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Met Cys Gin Glu Asn Asn 

970 975 980 

agg tgg ttc ctt get ggt gtg acc tea ttt gga tac aag tgt gec ctg 3031 
Arg Trp Phe Leu Ala Gly Val Thr Ser Phe Gly Tyr Lys Cys Ala Leu 

985 990 995 

cct aat cgc ccc gga gtg tat gec agg gtc tea agg ttt acc gaa tgg 3079 
Pro Asn Arg Pro Gly Val Tyr Ala Arg Val Ser Arg Phe Thr Glu Trp 
1000 1005 1010 

ata caa agt ttt eta cat tag egcatttett aaactaaaca ggaaagtege 313 0 
lie Gin Ser Phe Leu His * 
1015 

attattttcc cattctactc tagaaagcat ggaaattaag tgtttcgtac aaaaatttta 3190 

aaaagttacc aaaggttttt attcttacct atgtcaatga aatgctaggg ggccagggaa 3250 

acaaaatttt aaaaataata aaattcacca tagcaataca gaataacttt aaaataccat 3310 

taaatacatt tgtatttcat tgtgaacagg tatttcttca cagatctcat ttttaaaatt 3370 

cttaatgatt atttttatta cttactgttg tttaaaggga tgttatttta aagcatatac 343 0 

catacactta agaaatttga gcagaattta aaaaagaaag aaaataaatt gtttttccca 3490 

aagtatgtca ctgttggaaa taaactgeca taaattttct agttccagtt tagtttgctg 3550 

ctattagcag aaactcaatt gtttctctgt cttttctatc aaaattttca acatatgeat 3610 

aaccttagta ttttcccaac caatagaaac tatttattgt aagcttatgt cacaggcctg 367 0 

gactaaattg attttaegtt cctctt 3696 



<210> 64 

<211> 1019 

<212> PRT 

<213> Homo Sapien 



<400> 64 



Met 


Gly 


Ser 


Lys 


1 








Tyr 


Glu 


lie 


Met 








20 


Ala 


Gly 


Leu 


He 






35 




Gly 


Ala 


Ala 


Leu 


50 






Thr 


Ser 


Gly 


Val 


65 








Asp 


Phe 


Lys 


Val 


Phe 


Leu 


Ser 


Ser 








100 


Gin 


Phe 


Glu 


Asn 






115 




Gin 


Trp 


Val 


Ser 




130 






Glu 


Ala 


Asn 


Lys 


145 








Ser 


Val 


Asp 


He 


Pro 


Gly 


Asn 


Val 








180 


Asp 


Ala 


Leu 


Thr 






195 




Asn 


Cys 


Pro 


Asp 




210 







Arg 


Gly 


He 


Ser 


5 








Phe 


Ala 


Ala 


Leu 


Ala 


Val 


Ser 


Cys 








40 


Gly 


Gin 


Ser 


His 






55 




Thr 


Tyr 


Asn 


Pro 




70 






Leu 


Ala 


Phe 


Asp 


85 








Asn 


Leu 


Lys 


Asn 


Gly 


Ser 


lie 


He 






120 


Asp 


Gin 


Asn 


Val 






135 




Ser 


Ser 


Gin 


Leu 




150 






Leu 


Asp 


Lys 


Leu 


165 








Ser 


He 


Glu 


Cys 


Cys 


He 


Lys 


Ala 








200 


Gly 


Ser 


Asp 


Glu 



215 



Ser 


Arg 


His 


His 




10 






Phe 


Ala 


He 


Leu 


25 








Leu 


Thr 


He 


Lys 


Glu 


Ala 


Arg 


Ala 






60 


Asn 


Leu 


Gin 


Asp 






75 




Leu 


Gin 


Gin 


Met 




90 






Glu 


Tyr 


Lys 


Asn 


105 








Val 


Val 


Phe 


Asp 


Lys 


Glu 


Glu 


Leu 






140 


Val 


Thr 


Phe 


His 






155 




Thr 


Thr 


Thr 


Ser 




170 






Leu 


Pro 


Gly 


Ser 


185 








Asp 


Leu 


Phe 


Cys 


Asp 


Asn 


Lys 


Met 








220 



Ser 


Leu 


Ser 


Ser 






15 




Val 


Val 


Leu 


Cys 




30 






Glu 


Ser 


Gin 


Arg 


45 








Thr 


Phe 


Lys 


He 


Lys 


Leu 


Ser 


Val 








80 


He 


Asp 


Glu 


He 






95 




Ser 


Arg 


Val 


Leu 




110 






Leu 


Phe 


Phe 


Ala 


125 








He 


Gin 


Gly 


Leu 


He 


Asp 


Leu 


Asn 








160 


His 


Leu 


Ala 


Thr 






175 




Ser 


Pro 


Cys 


Thr 




190 






Asp 


Gly 


Glu 


Val 


205 








Cys 


Ala 


Thr 


Val 
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wys 


A can 


r:l v 


TV 

Aty 










T>V| •y 


His 


*yr 


Pro 


J. J.C 


ax y 


Val 


TV O Y1 

ash 




*"* 




o c rt 




TVi-r- 


iyr 


iyr 






OTP 

275 




Ser 


Lys 


lie 


Leu 




290 






Arg 


x le 




set 


T A r* 

3 05 








CilU 






Tyr 


t>er 


UrlU 


Leu 


Asn 








340 


pne 


cys 


rfle 


irp 






T f r* 

355 




lie 


Gin 


Gly 


Ser 




370 






ml, v 

THr 


rfle 


Cjly 


Asn 


n ft r 

385 








Gly 


Gly 


Arg 


Gin 


Tin? 


Leu 


GlU 


Pro 








^ o n 


As xi 


Val 


£116 


Lys 






435 




Lys 


Thr 


Val 


Phe 




450 






Gly 


bin 


vai 


inr 


465 








TV "1 _ 


Pne 


LyB 


Asn 


Leu. 


inr 


Tyr 


uiy 








r a a 

bOU 


VaX 
















Glu 


Leu 


Trp 


Glu 




530 






ser 


iyr 


pro 


Asn 










ijys 


Hen 


Tl p 


noil 


7V m 
ASp 


V ctX 


V d-L 








con 




A 1 a 
MXCl 


V CL^ 


j.yx 






595 




Thr 


Asn 


Arg 


Met 




610 






Gly 


Gly 


Phe 


Lys 


625 








Glu 


Pro 


Cys 


Lys 


Pro 


Leu 


Val 


Asn 








660 


Asp 


Glu 


Ala 


Asp 






675 




Gly 


Leu 


Val 


Arg 




690 






Glu 


Asn 


Trp 


Thr 



Phe Leu Leu Thr 
230 

Lys Pro Ser Glu 
245 

Gin Gly Leu Ser 

Thr Asp lie Leu 

280 

Arg Ala Ser lie 
295 

Asn Gin Val Thr 
310 

Val Gly Phe Asn 
325 

Asn Tyr Glu Lys 

Val Gin Asp Leu 

360 

Thr Phe Ser Pro 
375 

Ala Ser Gly Phe 
390 

Glu Arg Val Gly 
405 

Ala Cys Leu Ser 

Leu Ser lie Asn 

440 

Gin Lys Glu Gly 
455 

Leu Asn Glu Thr 
470 

Lys lie Leu Ser 
485 

lie Cys Asn Gly 

Pro Pro Glu Leu 

520 

Pro Asn Thr Thr 
535 

Leu Ala Phe Cys 
550 

Gin Leu His Phe 
565 

Glu lie Arg Asp 

Thr Gly Pro Gly 

600 

Thr Val Leu Leu 
615 

Ala Asn Phe Thr 
630 

Ala Asp His Phe 
645 

Leu Cys Asp Gly 

Cys Val Arg Phe 

680 

Phe Arg lie Gin 
695 

Thr Gin lie Ser 
710 



Gly Ser Ser Gly 
235 

Thr Ser Val Val 
250 

lie Lys Leu Ser 
265 

Asp lie Tyr Glu 

Trp Glu Thr Asn 

300 

Ala Thr Phe Leu 
315 

Ala Thr Tyr Thr 
330 

lie Asn Cys Asn 
345 

Asn Asp Asp Asn 

Phe Thr Gly Pro 

380 

Tyr lie Ser Thr 
395 

Leu Leu Ser Leu 
410 

Phe Trp Tyr His 
425 

lie Ser Asn Asp 

Asn Tyr Gly Asp 

460 

Val Lys Phe Lys 
475 

Asp lie Ala Leu 
490 

Ser Leu Tyr Pro 
505 

Pro Thr Asp Cys 

Phe Ser Ser Thr 

540 

Val Trp lie Leu 
555 

Gin Glu Phe Asp 
570 

Gly Glu Glu Ala 
585 

Pro Val Lys Asp 

lie Thr Asn Asp 

620 

Thr Gly Tyr His 
635 

Gin Cys Lys Asn 
650 

His Leu His Cys 
665 

Phe Asn Gly Thr 

Ser lie Trp His 

700 

Asn Asp Val Cys 
715 



Ser Phe Gin Ala 

240 

Cys Gin Trp lie 
255 

Phe Asp Asp Phe 
270 

Gly Val Gly Ser 
285 

Pro Gly Thr lie 

lie Glu Ser Asp 

320 

Ala Phe Asn Ser 
335 

Phe Glu Asp Gly 
350 

Glu Trp Glu Arg 
365 

Asn Phe Asp His 

Pro Thr Gly Pro 

400 

Pro Leu Asp Pro 
415 

Met Tyr Gly Glu 
430 

Gin Asn Met Glu 
445 

Asn Trp Asn Tyr 

Val Ala Phe Asn 

480 

Asp Asp lie Ser 
495 

Glu Pro Thr Leu 
510 

Gly Gly Pro Phe 
525 

Asn Phe Pro Asn 

Asn Ala Gin Lys 

560 

Leu Glu Asn lie 
575 

Asp Ser Leu Leu 
590 

Val Phe Ser Thr 
605 

Val Leu Ala Arg 

Leu Gly lie Pro 

640 

Gly Glu Cys Val 
655 

Glu Asp Gly Ser 
670 

Thr Asn Asn Asn 
685 

Thr Ala Cys Ala 

Gin Leu Leu Gly 

720 
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T .011 
JUCU 


uiy 


Ser Gly Asn Ser 


Car 

OCX 


T Arc 


~X 


Tip 

11C 


rue 




X 111 


nop 


vriy 


vj-Ly 








725 








Tin 










/ j3 




rlO 


xrXie 


Val Lys 


Leu Asn 


X IXX 


Ala 


£r ±. \J 


nop 


uiy 


1T1 B 


lie u 


lie 


LCU 


X 111 






740 








1 A C. 










T C A 






riO 


O c\ >~ 
OCX 


Gin Gin 


Cys Leu 
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Q^T" 
OCX 


xjcu 


Tip 

11C 


Aiy 
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A£>11 
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jjys 


vjIU 


\aiy 


Aia 
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Val 
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Q 1 f\ 

oiu 
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O A >«• 
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ASp 
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Val Ser 
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£11 S 


Lys 


Val 


xyr 
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ash 


JjcU 






820 
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VjlU 


xTO 


Ser Lys 
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116 


T i 
lieU 
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jjys 


o 
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Ser Pro 
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Val 


Dm 
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Aivy 


T .ot i 


116 


nop 
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Val 
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% ft T^l 
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ASp 


TV **i T~i 
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ASp 
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o oU 


£11 S 
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x 111 
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1 lc 
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Tip 
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o a n 

o y U 










O Q C 
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Glu Glu 
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Pro 


XT X. kj» 


uiy 


A-L7CJ 


Asn 


Cys 




x xe 






900 








905 










910 






Ala 


Gly 


Trp Gly Thr Val' 


Val 


Tyr 


Gin 


Gly 


Thr 


Thr 


Ala 


Asn 


He 


Leu 






915 






920 










925 








Gin 


Glu 


Ala Asp 


Val Pro 


Leu 


Leu 


Ser 


Asn 


Glu 


Arg 


Cys 


Gin 


Gin 


Gin 




930 






935 










940 










Met 


Pro 


Glu Tyr Asn lie 


Thr 


Glu 


Asn 


Met 


lie 


Cys 


Ala 


Gly 


Tyr 


Glu 


945 






950 










955 








960 


Glu 


Gly 


Gly lie 


Asp Ser 


Cys 


Gin 


Gly 


Asp 


Ser 


Gly 


Gly 


Pro 


Leu 


Met 








965 








970 










975 




Cys 


Gin 


Glu Asn Asn Arg 


Trp 


Phe 


Leu 


Ala 


Gly 


Val 


Thr 


Ser 


Phe 


Gly 






980 








985 










990 






Tyr 


Lys 


Cys Ala 


Leu Pro 


Asn 


Arg 


Pro 


Gly 


Val 


Tyr 


Ala 


Arg 


Val 


Ser 






995 






1000 
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Arg 


Phe 


Thr Glu 
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Ser 


Phe 
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<210> 65 

<211> 1500 

<212> DNA 

<213> Homo Sapien 

<220> 
<221> CDS 

<222> (62) . . . (1318) 

<223> Nucleotide sequence encoding human airway 
trypsin-like protease 

<300> 

<308> GenBank AB002134 
<309> 1998-06-04 

<400> 65 

gagtgggaat ctcaaagcag ttgagtaggc agaaaaaaga acctcttcat taaggattaa 60 
a atg tat agg cca gca cgt gta act teg act tea aga ttt ctg aat cca 109 
Met Tyr Arg Pro Ala Arg Val Thr Ser Thr Ser Arg Phe Leu Asn Pro 
15 10 15 

tat gta gta tgt ttc att gtc gtc gca ggg gta gtg ate ctg gca gtc 157 
Tyr Val Val Cys Phe He Val Val Ala Gly Val Val He Leu Ala Val 
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20 25 30 

acc ata get eta ctt gtt tac ttt tta get ttt gat caa aaa tct tac 205 
Thr lie Ala Leu Leu Val Tyr Phe Leu Ala Phe Asp Gin Lys Ser Tyr 
35 40 45 

ttt tat agg age agt ttt caa etc eta aat gtt gaa tat aat agt cag 253 
Phe Tyr Arg Ser Ser Phe Gin Leu Leu Asn Val Glu Tyr Asn Ser Gin 
50 55 60 

tta aat tea cca get aca cag gaa tac agg act ttg agt gga aga att 301 
Leu Asn Ser Pro Ala Thr Gin Glu Tyr Arg Thr Leu Ser Gly Arg lie 
65 70 75 80 

gaa tct ctg att act aaa aca ttc aaa gaa tea aat tta aga aat cag 349 
Glu Ser Leu lie Thr Lys Thr Phe Lys Glu Ser Asn Leu Arg Asn Gin 

85 90 95 

ttc ate aga get cat gtt gec aaa ctg agg caa gat ggt agt ggt gtg 397 
Phe lie Arg Ala His Val Ala Lys Leu Arg Gin Asp Gly Ser Gly Val 

100 105 110 

aga gcg gat gtt gtc atg aaa ttt caa ttc act aga aat aac aat gga 445 
Arg Ala Asp Val Val Met Lys Phe Gin Phe Thr Arg Asn Asn Asn Gly 
115 120 125 

gca tea atg aaa age aga att gag tct gtt tta cga caa atg ctg aat 493 
Ala Ser Met Lys Ser Arg lie Glu Ser Val Leu Arg Gin Met Leu Asn 
130 135 140 

aac tct gga aac ctg gaa ata aac cct tea act gag ata aca tea ctt 541 
Asn Ser Gly Asn Leu Glu lie Asn Pro Ser Thr Glu lie Thr Ser Leu 
145 150 155 160 

act gac cag get gca gca aat tgg ctt att aat gaa tgt ggg gee ggt 589 
Thr Asp Gin Ala Ala Ala Asn Trp Leu lie Asn Glu Cys Gly Ala Gly 

165 170 175 

cca gac eta ata aca ttg tct gag cag aga ate ctt gga ggc act gag 637 
Pro Asp Leu lie Thr Leu Ser Glu Gin Arg lie Leu Gly Gly Thr Glu 

180 185 190 

get gag gag gga age tgg ccg tgg caa gtc agt ctg egg etc aat aat 685 
Ala Glu Glu Gly Ser Trp Pro Trp Gin Val Ser Leu. Arg Leu Asn Asn 
195 200 205 

gec cac cac tgt gga ggc age ctg ate aat aac atg tgg ate ctg aca 733 
Ala His His Cys Gly Gly Ser Leu lie Asn Asn Met Trp lie Leu Thr 
210 215 220 

gca get cac tgc ttc aga age aac tct aat cct cgt gac tgg att gee 781 
Ala Ala His Cys Phe Arg Ser Asn Ser Asn Pro Arg Asp Trp lie Ala 
225 230 235 240 

acg tct ggt att tec aca aca ttt cct aaa eta aga atg aga gta aga 829 
Thr Ser Gly lie Ser Thr Thr Phe Pro Lys Leu Arg Met Arg Val Arg 

245 250 255 

aat att tta att cat aac aat tat aaa tct gca act cat gaa aat gac 877 
Asn lie Leu lie His Asn Asn Tyr Lys Ser Ala Thr His Glu Asn Asp 

260 265 270 
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att gca ctt gtg aga ctt gag aac agt gtc acc ttt acc^aaa gat ate 925 
lie Ala Leu Val Arg Leu Glu Asn Ser Val Thr Phe Thr Lys Asp lie 
275 280 285 

cat agt gtg tgt etc cca get get acc cag aat att cca cct ggc tct 973 
His Ser Val Cys Leu Pro Ala Ala Thr Gin Asn lie Pro Pro Gly Ser 
290 295 300 

act get tat gta aca gga tgg ggc get caa gaa tat get ggc cac aca 1021 
Thr Ala Tyr Val Thr Gly Trp Gly Ala Gin Glu Tyr Ala Gly His Thr 
305 310 315 320 

gtt cca gag eta agg caa gga cag gtc aga ata ata agt aat gat gta 1069 
Val Pro Glu Leu Arg Gin Gly Gin Val Arg lie lie Ser Asn Asp Val 

325 330 335 

tgt aat gca cca cat agt tat aat gga gee ate ttg tct gga atg ctg 1117 
Cys Asn Ala Pro His Ser Tyr Asn Gly Ala lie Leu Ser Gly Met Leu 

340 345 350 

tgt get gga gta cct caa ggt gga gtg gac gca tgt cag ggt gac tct 1165 
Cys Ala Gly Val Pro Gin Gly Gly Val Asp Ala Cys Gin Gly Asp Ser 
355 360 365 

ggt ggc cca eta gta caa gaa gac tea egg egg ctt tgg ttt att gtg 1213 
Gly Gly Pro Leu Val Gin Glu Asp Ser Arg Arg Leu Trp Phe He Val 
370 375 380 

ggg ata gta age tgg gga gat cag tgt ggc ctg ccg gat aag cca gga 1261 
Gly He Val Ser Trp Gly Asp Gin Cys Gly Leu Pro Asp Lys Pro Gly 
385 . 390 395 400 

gtg tat act cga gtg aca gee tac ctt gac tgg att agg caa caa act 13 09 

Val Tyr Thr Arg Val Thr Ala Tyr Leu Asp Trp He Arg Gin Gin Thr 

405 410 415 

ggg ate tag tgcaacaagt gcatccctgt tgeaaagtet gtatgcaggt 1358 
Gly He * 

gtgcctgtct taaattccaa agctttacat ttcaactgaa aaagaaacta gaaatgtcct 1418 

aatttaacat cttgttacat aaatatggtt taacaaacac tgtttaacct ttctttatta 1478 

ttaaaggttt tctattttct cc 1500 

<210> 66 

<211> 418 

<212> PRT 

<213> Homo Sapien 



<400> 66 












Met Tyr Arg 


Pro 


Ala 


Arg 


Val 


Thr 


1 




5 








Tyr Val Val 


Cys 
20 


Phe 


He 


Val 


Val 


Thr He Ala 


Leu 


Leu 


Val 


Tyr 


Phe 


35 










40 


Phe Tyr Arg 


Ser 


Ser 


Phe 


Gin 


Leu 


50 








55 




Leu Asn Ser 


Pro 


Ala 


Thr 


Gin 


Glu 


65 






70 






Glu Ser Leu 


He 


Thr 


Lys 


Thr 


Phe 



85 



Ser 


Thr 


Ser 


Arg 


Phe 


Leu 


Asn 


Pro 




10 










15 




Ala 


Gly Val 


Val 


He 


Leu 


Ala 


Vaa 


25 










30 






Leu 


Ala 


Phe 


Asp 


Gin 


Lys 


Ser 


Tyr 










45 








Leu 


Asn 


Val 


Glu 


Tyr 


Asn 


Ser 


Gin 








60 










Tyr 


Arg 


Thr 


Leu 


Ser 


Gly 


Arg 


He 






75 










80 


Lys 


Glu 


Ser 


Asn 


Leu 


Arg 


Asn 


Gin 


90 










95 
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rue 


lie 

X X 




Ala 

too 


His 


Val 


Ala 


Lvs 


Leu 
105 

^ W *mmf 


Arcr 


Gin 

1* 


As 13 


Glv 

V3X 


Ser 
110 


Glv 


Val 


Aver 


Ala 




Val 


Val 


.Met 


Lvs 


Phe 


Gin 


Phe 


Thr 


Aro 


Asn 


Asn 


Asn 


Glv 




TIC? 




















X 








Ala 


Ser 


Met 


Lys 


Ser 


Arg 


He 


Glu 


Ser 


Val 


Leu 


Arg 


Gin 


Met 


Leu 


Asn 




Ijw 








x j 3 










X*3 V 










7\ c? n 


OCX 


VJly 


nsil 


Leu 


Glu 


Tip 


Asn 




iJCl 


X 


Glu 


lie 

X X ^— 




±J 4^ 


Leu 

^^^^ 


T A C 








150 










133 










X O V/ 


inr 


nap 


Gl n 


7\ 1 o 
nla 


Ala 


Ala 


an 
risll 




IJC IX 


XX c 


na XX 


Glu 




Gl v 
wX y 


Ala 

nx GL 


Glv 








165 










1 / V 














Pro 


Asp 


Leu 


He 


Thr 


Leu 


Ser 


Glu 


Gin 


Arg 


He 


Leu 


Gly 


Gly 


Thr 


Glu 






i on 

lou 










i ft 

1D3 










127 v 






nia 


Gl n 


Gl ii 

iyo 


ri~\ \ r 


Ser 


Trp 


Prn 


*? n n 


Gin 


Val 

V CI X 


OCX 


LX 


2Vt*0 


XJw U 






nil => 

Ala 


ill 5 


T_T-I o 

nib 


v-y => 


Gly Gly 


Ser 


Leu 

■ 


lie 


T\ on 


ASH 




xxp 


Tie 

X xc 




Thr 

^ ill 














215 










*5 *5 n 










Ala 


AJ.SL 


TT J f-. 


t*ys 


Phe 


Arg 


Ser 


Asn 


Ser 


7V en 
/ioQ 


trJL C 






xxp 


X xc 


Al n 

oX CX 


*3 O C 










230 




















O/in 
*± w 


Thr 


o ex 


vjiy 


Tip 
116 


Ser 
245 


Thr 


Thr 


Phe 


Pro 


j£ 3 (J 




j*i-x y 


Met 


nx y 


Val 

V c^x 
O 


At*ct 

«^x ~J 


Asn 


11c 


lieu 


lie 


His 


Asn 


Asn 


Tyr 


Lys 


Qc*t~ 
O CX 


nla 


x xxx 


n .in s 


Gl ii 

uX LX 


Aon 
x%o XX 


A<3T"i 
















265 


















Ala 


JJCU 


v cxi 


Arg 


Leu 


Glu 


Asn 


Ser 


v cii 






Thr 

X XIX 


T,vr 


&erk 


He 

X xc 






2/5 








280 










ZDS 








His 

XXX O 


O C X 

290 


Val 


Gvs 


Leu 


Pro 


Ala 
295 


Ala 


Thr 


Gin 


Asn 


He 
300 


Pro 


Pro 


Glv 


Ser 


Thr 


Ala 


Tyr 


Val 


Thr 


Gly 


Trp 


Gly 


Ala 


Gin 


Glu 


Tyr 


Ala 


Gly 


His 


Thr 


305 










310 










315 










320 


Val 


Pro 


Glu 


Leu 


Arg 


Gin 


Gly Gin Val 


Arg 


He 


He 


Ser 


Asn 


Asp 


Val 










325 










330 










335 




Cys 


Asn 


Ala 


Pro 


His 


Ser 


Tyr 


Asn 


Gly 


Ala 


He 


Leu 


Ser 


Gly 


Met 


Leu 






340 










345 










350 






Cys 


Ala 


Gly 


Val 


Pro 


Gin 


Gly Gly Val 


Asp 


Ala 


Cys 


Gin 


Gly 


Asp 


Ser 




355 










360 










365 








Gly 


Gly 


Pro 


Leu 


Val 


Gin 


Glu 


Asp 


Ser 


Arg 


Arg 


Leu 


Trp 


Phe 


He 


Val 


370 










375 










380 










Gly 


He 


Val 


Ser 


Trp 


Gly 


Asp 


Gin 


Cys 


Gly 


Leu 


Pro 


Asp 


Lys 


Pro 


Gly 


385 










390 










395 










400 


Val 


Tyr 


Thr 


Arg 


Val 


Thr 


Ala 


Tyr 


Leu 


Asp 


Trp 


He 


Arg 


Gin 


Gin 


Thr 



405 410 415 



Gly He 



<210> 67 

<211> 1783 

<212> DNA 

<213> Homo Sapien 

<220> 
<221> CDS 

<222> (246) . . . (1499) 

<223> Nucleotide sequence encoding human heps in 
<300> 

<308> GenBank M18930 
<309> 1993-06-11 

<400> 67 

tcgagcccgc tttccaggga ccctacctga gggcccacag gtgaggcagc ctggcctagc 60 

aggccccacg ccaccgcctc tgcctccagg ccgcccgctg ctgcggggcc accatgctcc 120 

tgcccaggcc tggagactga cccgaccccg gcactacctc gaggctccgc ccccacctgc 180 

tggaccccag ggtcccaccc tggcccagga ggtcagccag ggaatcatta acaagaggca 240 
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gtgac atg gcg cag aag gag ggt ggc egg act gtg cca tgc tgc tec aga 290 
Met Ala Gin Lys Glu Gly Gly Arg Thr Val Pro Cys Cys Ser Arg 
1 5 10 15 

ccc aag gtg- gca get etc act gcg ggg ace ctg eta ctt ctg aca gee 338 
Pro Lys Val Ala Ala Leu Thr Ala Gly Thr Leu Leu Leu Leu Thr Ala 

20 25 30 

ate ggg gcg gca tec tgg gec att gtg get gtt etc etc agg agt gac 3 86 

He Gly Ala Ala Ser Trp Ala He Val Ala Val Leu Leu Arg Ser Asp 

35 40 45 

cag gag ccg ctg tac cca gtg cag gtc age tct gcg gac get egg etc 434 
Gin Glu Pro Leu Tyr Pro Val Gin Val Ser Ser Ala Asp Ala Arg Leu 
50 55 60 

atg gtc ttt gac aag acg gaa ggg acg tgg egg ctg ctg tgc tec teg 482 
Met Val Phe Asp Lys Thr Glu Gly Thr Trp Arg Leu Leu Cys Ser Ser 
65 70 75 

cgc tec aac gee agg gta gec gga etc age tgc gag gag atg ggc ttc 530 
Arg Ser Asn Ala Arg Val Ala Gly Leu Ser Cys Glu Glu Met Gly Phe 
80 85 90 95 

etc agg gca ctg ace cac tec gag ctg gac gtg cga acg gcg ggc gec 578 
Leu Arg Ala Leu Thr His Ser Glu Leu Asp Val Arg Thr Ala Gly Ala 

100 105 110 

aat ggc acg teg ggc ttc ttc tgt gtg gac gag ggg agg ctg ccc cac 626 
Asn Gly Thr Ser Gly Phe Phe Cys Val Asp Glu Gly Arg Leu Pro His 

115 120 125 

ace cag agg ctg ctg gag gtc ate tec gtg tgt gat tgc ccc aga ggc 674 
Thr Gin Arg Leu Leu Glu Val He Ser Val Cys Asp Cys Pro Arg Gly 
130 135 140 

cgt ttc ttg gee gee ate tgc caa gac tgt ggc cgc agg aag ctg ccc 722 
Arg Phe Leu Ala Ala lie Cys Gin Asp Cys Gly Arg Arg Lys Leu Pro 
145 150 155 

gtg gac cgc ate gtg gga ggc egg gac ace age ttg ggc egg tgg ccg 770 
Val Asp Arg He Val Gly Gly Arg Asp Thr Ser Leu Gly Arg Trp Pro 
160 165 170 175 

tgg caa gtc age ctt cgc tat gat gga gca cac etc tgt ggg gga tec 818 
Trp Gin Val Ser Leu Arg Tyr Asp Gly Ala His Leu Cys Gly Gly Ser 

180 185 190 



ctg etc tec ggg gac tgg gtg ctg aca gee gee cac tgc ttc ccg gag 
Leu Leu Ser Gly Asp Trp Val Leu Thr Ala Ala His Cys Phe Pro Glu 

195 200 205 



866 



egg aac egg gtc ctg tec cga tgg cga gtg ttt gee ggt gee gtg gee 914 
Arg Asn Arg Val Leu Ser Arg Trp Arg Val Phe Ala Gly Ala Val Ala 
210 215 220 

cag gec tct ccc cac ggt ctg cag ctg ggg gtg cag get gtg gtc tac 962 
Gin Ala Ser Pro His Gly Leu Gin Leu Gly Val Gin Ala Val Val Tyr 
225 230 235 

cac ggg ggc tat ctt ccc ttt egg gac ccc aac age gag gag aac age 1010 
His Gly Gly Tyr Leu Pro Phe Arg Asp Pro Asn Ser Glu Glu Asn Ser 
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240 245 250 255 

aac gat att gcc ctg gtc cac etc tec agt ccc ctg ccc etc aca gaa 1058 
Asn Asp He Ala Leu Val His Leu Ser Ser Pro Leu Pro Leu Thr Glu 

260 265 270 

tac ate cag cct gtg tgc etc cca get gcc ggc cag gcc ctg gtg gat 1106 
Tyr He Gin Pro Val Cys Leu Pro Ala Ala Gly Gin Ala Leu Val Asp 

275 280 285 

ggc aag ate tgt acc gtg acg ggc tgg ggc aac acg cag tac tat ggc 1154 
Gly Lys He Cys Thr Val Thr Gly Trp Gly Asn Thr Gin Tyr Tyr Gly 
290 295 300 

caa cag gcc ggg gta etc cag gag get cga gtc ccc ata ate age aat 1202 
Gin Gin Ala Gly Val Leu Gin Glu Ala Arg Val Pro He He Ser Asn 
305 310 315 

gat gtc tgc aat ggc get gac ttc tat gga aac cag ate aag ccc aag 1250 
Asp Val Cys Asn Gly Ala Asp Phe Tyr Gly Asn Gin He Lys Pro Lys 
320 325 330 335 

atg .ttc tgt get ggc tac ccc gag ggt ggc att gat gcc tgc cag ggc 1298 
Met Phe Cys Ala Gly Tyr Pro. Glu Gly Gly He Asp Ala Cys Gin Gly 

340 345 350 

gac age ggt ggt ccc ttt gtg tgt gag gac age ate tct egg acg cca 1346 
Asp Ser Gly Gly Pro Phe Val Cys Glu Asp Ser He Ser Arg Thr Pro 

355 360 365 

cgt tgg egg ctg tgt ggc att gtg agt tgg ggc act ggc tgt gcc ctg 1394 
Arg Trp Arg Leu Cys Gly He Val Ser Trp Gly Thr Gly Cys Ala Leu 
370 375 380 

gcc cag aag cca ggc gtc tac acc'aaa gtc agt gac ttc egg gag tgg 1442 
Ala Gin Lys Pro Gly Val Tyr Thr Lys Val Ser Asp Phe Arg Glu Trp 
385 390 395 

ate ttc cag gcc ata aag act cac tec gaa gcc age ggc atg gtg acc 1490 
He Phe Gin Ala He Lys Thr His Ser Glu Ala Ser Gly Met Val Thr 
400 405 410 415 

cag etc tga ccggtggctt ctcgctgcgc agcctccagg geccgaggtg 1539 
Gin Leu * 

atcccggtgg tgggatccac getgggcega ggatgggacg tttttcttct tgggcccggt 1599 

ccacaggtcc aaggacaccc tccctccagg gtcctctctt ccacagtggc gggcccactc 1659 

agccccgaga ccacccaacc tcaccctcct gacccccatg taaatattgt tetgetgtet 1719 

gggactcctg tetaggtgee cctgatgatg ggatgetett taaataataa agatggtttt 1779 

gatt 1783 

<210> 68 

<211> 417 

<212> PRT 

<213> Homo Sapien 

<400> 68 

Met Ala Gin Lys Glu Gly Gly Arg Thr Val Pro Cys Cys Ser Arg Pro 

15 10 15 

Lys Val Ala Ala Leu Thr Ala Gly Thr Leu Leu Leu Leu Thr Ala He 

20 25 30 
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Gly Ala Ala Ser 
35 

Glu Pro Leu Tyr 
50 

Val Phe Asp Lys 
65 

Sex Asn Ala Arg 

Arg Ala Leu Thr 

100 

Gly Thr Ser Gly 
115 

Gin Arg Leu Leu 
130 

Phe Leu Ala Ala 
145 

Asp Arg lie Val 

Gin Val Ser Leu 

180 

Leu Ser Gly Asp 
195 

Asn Arg Val Leu 
210 

Ala Ser Pro His 
225 

Gly Gly Tyr Leu 

Asp lie Ala Leu 

260 

lie Gin Pro Val 
275 

Lys lie Cys Thr 
290 

Gin Ala Gly Val 
3 05 

Val Cys Asn Gly 

Phe Cys Ala Gly 

340 

Ser Gly Gly Pro 
355 

Trp Arg Leu Cys 
370 

Gin Lys Pro Gly 
385 

Phe Gin Ala lie 
Leu 



Trp Ala lie Val 

40 

Pro Val Gin Val 
55 

Thr Glu Gly Thr 
70 

Val Ala Gly Leu 
85 

His Ser Glu Leu 

Phe Phe Cys Val 

120 

Glu Val lie Ser 
135 

lie Cys Gin Asp 
150 

Gly Gly Arg Asp 
165 

Arg Tyr Asp Gly 

Trp Val Leu Thr 

200 

Ser Arg Trp Arg 
215 

Gly Leu Gin Leu 
230 

Pro Phe Arg Asp 
245 

Val His Leu Ser 

Cys Leu Pro Ala 

280 

Val Thr Gly Trp 
295 

Leu Gin Glu Ala 
310 

Ala Asp Phe Tyr 
325 

Tyr Pro Glu Gly 

Phe Val Cys Glu 

360 

Gly He Val Ser 
375 

Val Tyr Thr Lys 
390 

Lys Thr His Ser 
405 



Ala 


Val 


Leu 


Leu 




Ser 


Ala 


ASD 














T ,c»i i 


Leu 

■i-i w U 






75 




Ser 


Cys 


Glu 


Glu 




90 








v ex. _l 




X HX 










Asp 


Glu 


Glv 




Val 


Cys 












140 


Cys 


Gly 














Thr 


Ser 


Leu 


Gly 




170 






Ala 


His 




v_> y o 


185 








Ala 


Ala 




^> jr O 


Val 


Phe 












*? o n 


Gly Val 


VJJLA1 


Ala 






9-3C 
£t O 3 




Pro 


Asn 


Ser 


Glu 




250 






Ser 


Pro 




T>T~0 

S J— SmJ 


265 








Ala 


Gly 




Ala 


Gly 


Asn 


X 11 J- 


m n 








Arg 


Val 


Pro 


He 






315 




Gly Asn 


Gin 


He 




330 






Gly 


He 


Asp 


Ala 


345 








Asp 


Ser 


He 


Ser 


Trp 


Gly 


Thr 


Gly 








380 


Val 


Ser 


Asp 


Phe 






395 




Glu 


Ala 


Ser 


Gly 




410 







Arg Ser Asp Gin 
45 

Ala Arg Leu Met 

Cys Ser Ser Arg 

80 

Met Gly Phe Leu 
95 

Ala Gly Ala Asn 
110 

Leu Pro His Thr 
125 

Pro Arg Gly Arg 

Lys Leu Pro Val 

160 

Arg Trp Pro Trp 
175 

Gly Gly Ser Leu 
190 

Phe Pro Glu Arg 
205 

Ala Val Ala Gin 

Val Val Tyr His 

240 

Glu Asn Ser Asn 
255 

Leu Thr Glu Tyr 
270 

Leu Val Asp Gly 
285 

Tyr Tyr Gly Gin 

He Ser Asn Asp 

320 

Lys Pro Lys Met 
335 

Cys Gin Gly Asp 
350 

Arg Thr Pro Arg 
3 65 

Cys Ala Leu Ala 

Arg Glu Trp He 

400 

Met Val Thr Gin 
415 



<210> 69 

<211> 2479 

<212> DNA 

<213> Homo sapien 

<220> 
<221> CDS 

<222> (57) . . . (1535) 

<223> Nucleotide sequence encoding human serine protease 



<300> 
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<308> GenBank U75329 
<309> 1997-10-10 

<400> 69 

gtcatattga acattccaga tacctatcat tactcgatgc tgttgataac agcaag atg 59 

Met 
1 

get ttg aac tea ggg tea cca cca get att gga cct tac tat gaa aac 107 
Ala Leu Asn Ser Gly Ser Pro Pro Ala lie Gly Pro Tyr Tyr Glu Asn 

5 10 is 

cat gga tac caa ccg gaa aac ccc tat ccc gca cag ccc act gtg gtc 155 
His Gly Tyr Gin Pro Glu Asn Pro Tyr Pro Ala Gin Pro Thr Val Val 
20 25 30 

ccc act gtc tac gag gtg cat ccg get cag tac tac ccg tec ccc gtg 203 
Pro Thr Val Tyr Glu Val His Pro Ala Gin Tyr Tyr Pro Ser Pro Val 
35 40 45 

ccc cag tac gee ccg agg gtc ctg acg cag get tec aac ccc gtc gtc 251 
Pro Gin Tyr Ala Pro Arg Val Leu Thr Gin Ala Ser Asn Pro Val Val 
50 55 60 65 

tgc acg cag ccc aaa tec cca tec ggg aca gtg tgc ace tea aag act 299 
Cys Thr Gin Pro Lys Ser Pro Ser Gly Thr Val Cys Thr Ser Lys Thr 

70 75 80 

aag aaa gca ctg tgc ate ace ttg acc ctg ggg ace ttc etc gtg gga 347 
Lys Lys Ala Leu Cys lie Thr Leu Thr Leu Gly Thr Phe Leu Val Gly 

85 90 95 

get gcg ctg gee get ggc eta etc tgg aag ttc atg ggc age aag tgc 395 
Ala Ala Leu Ala Ala Gly Leu Leu Trp Lys Phe Met Gly Ser Lys Cys 
100 105 110 

tec aac tct ggg ata gag tgc gac tec tea ggt acc tgc ate aac ccc 443 
Ser Asn Ser Gly He Glu Cys Asp Ser Ser Gly Thr Cys He Asn Pro 
115 120 125 

tct aac tgg tgt gat ggc gtg tea cac tgc ccc ggc ggg gag gac gag 491 
Ser Asn Trp Cys Asp Gly Val Ser His Cys Pro Gly Gly Glu Asp Glu 
130 135 140 145 

aat egg tgt gtt cgc etc tac gga cca aac ttc ate ctt cag atg tac 539 
Asn Arg Cys Val Arg Leu Tyr Gly Pro Asn Phe He Leu Gin Met Tyr 

150 155 160 

tea tct cag agg aag tec tgg cac cct gtg tgc caa gac gac tgg aac 587 
Ser Ser Gin Arg Lys Ser Trp His Pro Val Cys Gin Asp Asp Trp Asn 

165 170 175 

gag aac tac ggg egg gcg gec tgc agg gac atg ggc tat aag aat aat 635 
Glu Asn Tyr Gly Arg Ala Ala Cys Arg Asp Met Gly Tyr Lys Asn Asn 
180 185 190 

ttt tac tct age caa gga ata gtg gat gac age gga tec acc age ttt 683 
Phe Tyr Ser Ser Gin Gly lie Val Asp Asp Ser Gly Ser Thr Ser Phe 
195 200 205 

atg aaa ctg aac aca agt gee ggc aat gtc gat ate tat aaa aaa ctg 731 
Met Lys Leu Asn Thr Ser Ala Gly Asn Val Asp He Tyr Lys Lys Leu 



WO 01/57194 



61/66 



PCT/US01/03471 



210 

tac cac agt 
Tyr His Ser 



tta gcc tgc 
Leu Ala Cys 



ggc ggt gag 
Gly Gly Glu 
260 

cac gtc cag 
His Val Gin 
275 

tgg ate gtg 
Trp lie Val 
290 

tgg cat tgg 
Trp His Trp 



tat gga gcc 
Tyr Gly Ala 



gac tec aag 
Asp Ser Lys 
340 

cct ctg act 
Pro Leu Thr 
355 

ggc atg atg 
Gly Met Met 
370 

gcc acc gag 
Ala Thr Glu 



gtg ctt etc 
Val Leu Leu 



aac ctg ate 
Asn Leu Xle 
420 

gtc gat tct 
Val Asp Ser 
435 

aac aat ate 
Asn Asn lie 
450 



215 

gat gcc tgt 
Asp Ala Cys 
230 

ggg gtc aac 

Gly Val Asn 
245 

age gcg etc 
Ser Ala Leu 



aac gtc cac 
Asn Val His 



aca gcc gcc 
Thr Ala Ala 
295 

acg gca ttt 
Thr Ala Phe 
310 

gga tac caa 
Gly Tyr Gin 
325 

acc aag aac 
Thr Lys Asn 



ttc aac gac 
Phe Asn Asp 



ctg cag cca 
Leu Gin Pro 
375 

gag aaa ggg 
Glu Lys Gly 
390 

att gag aca 
lie Glu Thr 
405 

aca cca gcc 
Thr Pro Ala 



tgc cag ggt 
Cys Gin Gly 



tgg tgg ctg 
Trp Trp Leu 
455 



tct tea aaa 
Ser Ser Lys 



ttg aac tea 
Leu Asn Ser 
250 

ccg ggg gcc 
Pro Gly Ala 
265 

gtg tgc gga 
Val Cys Gly 
280 

cac tgc gtg 
His Cys Val 



gcg ggg att 
Ala Gly lie 



gta caa aaa 
Val Gin Lys 
330 

aat gac att 
Asn Asp lie 
345 

eta gtg aaa 
Leu Val Lys 
360 

gaa cag etc 
Glu Gin Leu 



aag acc tea 
Lys Thr Ser 



cag aga tgc 
Gin Arg Cys 
410 

atg ate tgt 
Met lie Cys 
425 

gac agt gga 
Asp Ser Gly 
440 

ata ggg gat 
lie Gly Asp 



220 

gca gtg gtt 
Ala Val Val 
235 

age cgc cag 
Ser Arg Gin 



tgg ccc tgg 
Trp Pro Trp 



ggc tec ate 
Gly Ser lie 
285 

gaa aaa cct 
Glu Lys Pro 
300 

ttg aga caa 
Leu Arg Gin 
315 

gtg att tct 
Val lie Ser 



gcg ctg atg 
Ala Leu Met 



cca gtg tgt 
Pro Val Cys 
365 

tgc tgg att 
Cys Trp lie 
380 

gaa gtg ctg 
Glu Val Leu 
395 

aac age aga 
Asn Ser Arg 



gcc ggc ttc 
Ala Gly Phe 



ggg cct ctg 
Gly Pro Leu 
445 

aca age tgg 
Thr Ser Trp 
460 



tct tta cgc 
Ser Leu Arg 
240 

age agg ate 
Ser Arg lie 
255 

cag gtc age 
Gin Val Ser 
270 

ate acc ccc 
lie Thr Pro 



ctt aac aat 
Leu Asn Asn 



tct ttc atg 
Ser Phe Met 
320 

cat cca aat 
His Pro Asn 
335 

aag ctg cag 
Lys Leu Gin 
350 

ctg ccc aac 
Leu Pro Asn 



tec ggg tgg 
Ser Gly Trp 



aac get gcc 
Asn Ala Ala 
400 

tat gtc tat 
Tyr Val Tyr 
415 

ctg cag ggg 
Leu Gin Gly 
430 

gtc act teg 
Val Thr Ser 



ggt tct ggc 
Gly Ser Gly 



225 

tgt 779 
Cys 



gtg 827 
Val 



ctg 875 
Leu 



gag 923 
Glu 



cca 971 

Pro 

305 

ttc 1019 
Phe 



tat 1067 
Tyr 



aag 1115 
Lys 



cca 1163 
Pro 



ggg 1211 

Gly 

385 

aag 1259 
Lys 



gac 1307 
Asp 



aac 1355 
Asn 



aac 1403 
Asn 



tgt 1451 

Cys 

465 
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gcc aaa get tac aga cca gga gtg tac ggg aat gtg atg gta ttc acg 1499 
Ala Lye Ala Tyr Arg Pro Gly Val Tyr Gly Asn Val Met Val Phe Thr 

470 475 480 

gac tgg att tat cga caa atg aag gca aac ggc taa tccacatggt 1545 
Asp Trp lie Tyr Arg Gin Met Lys Ala Asn Gly * 

485 490 

S^?? tC ? t ^ P c 9 tc g tt: t tacaagaaaa caatggggct ggttttgctt ccccgtgcat 1605 

gatttactct tagagatgat tcagaggtca cttcattttt attaaacagt gaacttgtct 1665 

ggctttggca ctctctgcca tactgtgcag gctgcagtgg ctcccctgcc cagcctgctc 1725 

tccctaaccc cttgtccgca aggggtgatg gccggctggt tgtgggcact ggcggtcaat 1785 

tgtggaagga agagggttgg aggctgcccc cattgagatc ttcctgctga gtcctttcca 1845 

ggggecaatt ttggatgagc atggagctgt cacttctcag ctgctggatg acttgagatg 1905 

aaaaaggaga gacatggaaa gggagacagc caggtggcac ctgcagcggc tgccctctgg 1965 

ggccacttgg tagtgtcccc agcctacttc acaaggggat tttgctgatg ggttcttaga 2025 

gecttagcag ccctggatgg tggccagaaa taaagggacc agcccttcat gggtggtgac 2085 

gtggtagtca cttgtaaggg gaacagaaac atttttgttc ttatggggtg agaatataga 2145 

cagtgccctt ggtgcgaggg aagcaattga aaaggaactt gccctgagca ctcctqgtgc 2205 

aggtctccac ctgcacattg ggtggggctc ctgggaggga gactcagcct tcctcctcat 2265 

cctccctgac cctgctccta gcaccctgga gagtgaatgc cccttggtcc ctggcagggc 2325 

gccaagtttg gcaccatgtc ggcctcttca ggectgatag tcattggaaa ttgaggtcca 2385 

tgggggaaat caaggatget cagtttaagg tacactgttt ccatgttatg tttctacaca 2445 
ttgatggtgg tgaccctgag ttcaaageca tctt 



<210> 70 

<211> 492 

<212> PRT 

<213> Homo sapien 

<400> 70 

Met Ala Leu Asn Ser Gly Ser Pro Pro Ala lie Gly Pro Tyr Tyr Glu 

» X 5 10 15 

Asn Has Gly Tyr Gin Pro Glu Asn Pro Tyr Pro Ala Gin Pro Thr Val 

20 25 30 

Val Pro Thr Val Tyr Glu Val His Pro Ala Gin Tyr Tyr Pro Ser Pro 

35 40 45 

Val Pro Gin Tyr Ala Pro Arg Val Leu Thr Gin Ala Ser Asn Pro Val 

50 55 60 

Val Cys Thr Gin Pro Lys Ser Pro Ser Gly Thr Val Cys Thr Ser Lvs 

65 70 75 80 

Thr Lys Lys Ala Leu Cys lie Thr Leu Thr Leu Gly Thr Phe Leu Val 

85 90 95 

Gly Ala Ala Leu Ala Ala Gly Leu Leu Trp Lys Phe Met Gly Ser Lvs 
^ 100 105 no 

Cys Ser Asn Ser Gly He Glu Cys Asp Ser Ser Gly Thr Cys He Asn 

115 120 125 

Pro Ser Asn Trp Cys Asp Gly Val Ser His Cys Pro Gly Gly Glu Asp 

130 135 140 

Glu Asn Arg Cys Val Arg Leu Tyr Gly Pro Asn Phe He Leu Gin Met 

150 155 160 

Tyr Ser Ser Gin Arg Lys Ser Trp His Pro Val Cys Gin Asp Asp Trp 

165 170 175 

Asn Glu Asn Tyr Gly Arg Ala Ala Cys Arg Asp Met Gly Tyr Lys Asn 

180 185 190 

Asn Phe Tyr Ser Ser Gin Gly He Val Asp Asp Ser Gly Ser Thr Ser 

195 200 205 

Phe Met Lys Leu Asn Thr Ser Ala Gly Asn Val Asp He Tyr Lys Lys 

210 215 220 

Leu Tyr His Ser Asp Ala Cys Ser Ser Lys Ala Val Val Ser Leu Arq 

£ t , 230 235 240 

Cys Leu Ala Cys Gly Val Asn Leu Asn Ser Ser Arg Gin Ser Arg He 



2479 
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245 250 255 



Val 


Gly 


Gly Glu 


Ser 


Ala 


Leu 


Pro 


Gly 


Ala 


Trp 


Pro 


Trp 


Gin 


Val 


ber 








260 










o zt cr 
2 6b 










270 






Leu 


Has 


Val 


Gin 


Asn 


Val 


HIS 


Val 


Cys 


Gly Gly 


Ser 


I J.e 


He 


Thr 


Pro 






275 










280 










285 








Glu 


Trp 


He 


Val 


Thr 


Ala 


Ala 


His 


Cys 


Val 


Glu 


Lys 


Pro 


Leu 


Asn 


Asn 




290 










*i ft c 

295 










300 










Pro 


Trp 


His 


Trp 


■Hi 

Thr 


Ala 


Phe 


Ala 


Gly 


He 


Leu 


Arg 


Gin 


Ser 


Phe 


Met 


305 








Tift 

310 










315 










-i U 


Phe 


Tyr 


Gly Ala 


Gly 


Tyr 


Gin 


Val 


Gin 


Lys 


Val 


He 


Ser 


His 


Pro 


Asn 










*i ft c 

325 










330 










335 




Tyx 


Asp 


Ser 


Lys 


Thr 


Lys 


Asn 


Asn 


Asp 


He 


Ala 


Leu 


Met 


Lys 


Leu 


Gin 




340 




















350 






Lys 


Pro 


Leu 


Thr 


Phe 


Asn 


Asp 


Leu 


Val 


Lys 


Pro 


Val 


Cys 


Leu 


Pro 


Asn 




355 










JDU 










365 








Pro 


Gly 


Met 


Met 


Leu 


Gin 


Pro 


Glu 


Gin 


Leu 


Cys 


Trp 


He 


Ser Gly 


Trp 




370 










375 










380 










Gly 


Ala 


Thr 


Glu 


Glu 


Lys 


Gly 


Lys 


Thr 


Ser 


Glu 


Val 


Leu 


Asn 


Ala 


Ala 


385 










390 










395 










400 


Lys 


Val 


Leu 


Leu 


lie 


Glu 


Thr 


Gin 


Arg 


Cys 


Asn 


Ser Arg 


Tyr 


Val 


Tyr 








405 










410 










415 




Asp 


Asn 


Leu 


lie 


Thr 


Pro 


Ala 


Met 


He 


Cys 


Ala 


Gly 


Phe 


Leu 


Gin 


Gly 






420 










425 










430 






Asn 


Val 


Asp 


Ser 


Cys 


Gin 


Gly 


Asp 


Ser 


Gly Gly 


Pro 


Leu 


Val 


Thr 


Ser 






435 










440 










445 








Asn 


Asn 


Asn 


He 


Trp 


Trp 


Leu 


lie 


Gly 


Asp 


Thr 


Ser 


Trp 


Gly 


Ser 


Gly 




450 










455 










460 










Cys 


Ala 


Lys 


Ala 


Tyr 


Arg 


Pro 


Gly 


Val 


Tyr 


Gly 


Asn 


Val 


Met 


Val 


Phe 


465 










470 










475 










480 


Thr 


Asp 


Trp 


lie 


Tyr 


Arg 


Gin 


Met 


Lys 


Ala 


Asn 


Gly 











485 490 



<210> 71 

<211> 2079 

<212> DNA 

<213> Homo sapien 

<220> 
<221> CDS 

<222> (251) . . . (1522) 

<223> Nucleotide sequence encoding transmembrane 
protease, serine 4 (TMPRSS4) 

<300> 

<308> GenBank NM016425 
<309> 2000-11-06 

<400> 71 

gagaggcagc agcttgttca gcggacaagg atgctgggcg tgagggacca aggcctgccc 60 
tgcactcggg cctcctccag ccagtgctga ccagggactt ctgacctgct ggccagccag 120 
gacctgtgtg gggaggccct cctgctgcct tggggtgaca atctcagctc caggctacag 180 
ggagaccggg aggatcacag agccagcatg gtacaggatc ctgacagtga tcaacctctg 240 
aacagcctcg atg tea aac ccc tgc gca aac ccc gta tec cca tgg aga 289 

Met Ser Asn Pro Cys Ala Asn Pro Val Ser Pro Trp Arg 
15 10 

cct tea gaa agt gtg ggg ate ccc ate ate ata gca eta ctg age ctg 337 
Pro Ser Glu Ser Val Gly He Pro He He He Ala Leu Leu Ser Leu 
15 20 25 

gcg agt ate ate att gtg gtt gtc etc ate aag gtg att ctg gat aaa 385 
Ala Ser He He He Val Val Val Leu He Lys Val He Leu Asp Lys 
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30 35 40 45 

tac tac ttc etc tgc ggg cag cct etc cac ttc ate ccg agg aag cag 433 

Tyr Tyr Phe Leu Cys Gly Gin Pro Leu His Phe lie Pro Arg Lys Gin 

50 55 60 

ctg tgt gac gga gag ctg gac tgt ccc ttg ggg gag gac gag gag cac 481 

Leu Cys Asp Gly Glu Leu Asp Cys Pro Leu Gly Glu Asp Glu Glu His 

65 70 75 

tgt gtc aag age ttc ccc gaa ggg cct gca gtg gca gtc cgc etc tec 529 

Cys Val Lys Ser Phe Pro Glu Gly Pro Ala Val Ala Val Arg Leu .Ser 

80 85 90 

aag gac cga tec aca ctg cag gtg ctg gac teg gee aca ggg aac tgg 577 

Lys Asp Arg Ser Thr Leu Gin Val Leu Asp Ser Ala Thr Gly Asn Trp 

95 100 105 

ttc tct gee tgt ttc gac aac ttc aca gaa get etc get gag aca gee 625 

Phe Ser Ala Cys Phe Asp Asn Phe Thr Glu Ala Leu Ala Glu Thr Ala 

110 115 120 125 

tgt agg cag atg ggc tac age age aaa ccc act ttc aga get gtg gag 673 

Cys Arg Gin Met Gly Tyr Ser Ser Lys Pro Thr Phe Arg Ala Val Glu 

130 135 140 

att ggc cca gac cag gat ctg gat gtt gtt gaa ate aca gaa aac age 721 

lie Gly Pro Asp Gin Asp Leu Asp Val Val Glu lie Thr Glu Asn Ser 

145 150 155 

cag gag ctt cgc atg egg aac tea agt ggg ccc tgt etc tea ggc tec 769 

Gin Glu Leu Arg Met Arg Asn Ser Ser Gly Pro Cys Leu Ser Gly Ser 

160 165 170 

ctg gtc tec ctg cac tgt ctt gec tgt ggg aag age ctg aag acc ccc 817 

Leu Val Ser Leu His Cys Leu Ala Cys Gly Lys Ser Leu Lys Thr Pro 

175 180 185 

cgt gtg gtg ggt ggg gag gag gee tct gtg gat tct tgg cct tgg cag 865 

Arg Val Val Gly Gly Glu Glu Ala Ser Val Asp Ser Trp Pro Trp Gin 

190 195 200 205 

gtc age ate cag tac gac aaa cag cac gtc tgt gga ggg age ate ctg 913 

Val Ser lie Gin Tyr Asp Lys Gin His Val Cys Gly Gly Ser lie Leu 

210 215 220 

gac ccc cac tgg gtc etc acg gca gee cac tgc ttc agg aaa cat acc 961 

Asp Pro His Trp Val Leu Thr Ala Ala His Cys Phe Arg Lys His Thr 

225 230 235 

gat gtg ttc aac tgg aag gtg egg gca ggc tea gac aaa ctg ggc age 1009 

Asp Val Phe Asn Trp Lys Val Arg Ala Gly Ser Asp Lys Leu Gly Ser 

240 245 250 

ttc cca tec ctg get gtg gee aag ate ate ate att gaa ttc aac ccc 1057 

Phe Pro Ser Leu Ala Val Ala Lys lie lie lie lie Glu Phe Asn Pro 

255 260 265 

atg tac ccc aaa gac aat gac ate gee etc atg aag ctg cag ttc cca 1105 

Met Tyr Pro Lys Asp Asn Asp lie Ala Leu Met Lys Leu Gin Phe Pro 

270 275 280 285 
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etc act ttc tea ggc aca gtc agg ccc ate tgt ctg ccc ttc ttt gat 1153 

Leu Thr Phe Ser Gly Thr Val Arg Pro lie Cys Leu Pro Phe Phe Asp 

290 295 300 

gag gag etc act cca gec acc cca etc tgg ate att gga tgg ggc ttt 1201 

Glu Glu Leu Thr Pro Ala Thr Pro Leu Trp lie lie Gly Trp Gly Phe 

305 310 315 

acg aag cag aat gga ggg aag atg tct gac ata ctg ctg cag gcg tea 1249 

Thr Lys Gin Asn Gly Gly Lys Met Ser Asp He Leu Leu Gin Ala Ser 
320 325 330 

gtc cag gtc att gac age aca egg tgc aat gca gac gat gcg tac cag 1297 

Val Gin Val He Asp Ser Thr Arg Cys Asn Ala Asp Asp Ala Tyr Gin 
335 340 345 

ggg gaa gtc acc gag aag atg atg tgt gca ggc ate ccg gaa ggg ggt 1345 

Gly Glu Val Thr Glu LyB Met Met Cys Ala Gly He Pro Glu Gly Gly 
350 355 360 365 

gtg gac acc tgc cag ggt gac agt ggt ggg ccc ctg atg tac caa tct 1393 

Val Asp Thr Cys Gin Gly Asp Ser Gly Gly Pro Leu Met Tyr Gin Ser 

370 375 380 

gac cag tgg cat gtg gtg ggc ate gtt age tgg ggc tat ggc tgc ggg 1441 
Asp Gin Trp His Val Val Gly He Val Ser Trp Gly Tyr Gly Cys Gly 

385 390 395 



ggc ccg age acc cca gga gta tac acc aag gtc tea gec tat etc aac 
Gly Pro Ser Thr Pro Gly Val Tyr Thr Lys Val Ser Ala Tyr Leu Asn 
400 405 410 



1489 



tgg ate tac aat gtc tgg aag get gag ctg taa tgctgctgcc ectttgeagt 1542 
Trp He Tyr Asn Val Trp Lys Ala Glu Leu * 
415 420 

getgggagee gcttccttcc tgccctgccc acctggggat cccccaaagt cagacacaga 1602 

gcaagagtcc ccttgggtac acccctctgc ccacagcctc agcatttctt ggagcagcaa 1662 

agggectcaa ttcctgtaag agaccctcgc ageccagagg cgcccagagg aagtcagcag 1722 

ccctagctcg gccacacttg gtgctcccag catcccaggg agagacacag cccactgaac 1782 

aaggtctcag gggtattget aagccaagaa ggaactttcc cacactactg aatggaagca 1842 

ggctgtcttg taaaagecca gatcactgtg ggctggagag gagaaggaaa gggtctgege 1902 

cagccctgtc cgtcttcacc catccccaag cctactagag caagaaacca gttgtaatat 1962 

aaaatgeact gccctactgt tggtatgact accgttacct actgttgtca ttgttattac 2022 

agetatggee actattatta aagagctgtg taacatcaaa aaaaaaaaaa aaaaaaa 2079 

<210> 72 

<211> 423 

<212> PRT 

<213> Homo sapien 

<400> 72 

Met Ser Asn Pro Cys Ala Asn Pro Val Ser Pro Trp Arg Pro Ser Glu 

15 10 15 

Ser Val Gly He Pro He He He Ala Leu Leu Ser Leu Ala Ser He 

20 25 30 

He He Val Val Val Leu He Lys Val He Leu Asp Lys Tyr Tyr Phe 

35 40 45 

Leu Cys Gly Gin Pro Leu His Phe He Pro Arg Lys Gin Leu Cys Asp 

50 55 60 

Gly Glu Leu Asp Cys Pro Leu Gly Glu Asp Glu Glu His Cys Val Lys 
65 70 75 80 



WO 01/57194 



66/66 



PCT/US01/03471 



ber 


±rne 


pro 


ulU 


oer 


xxir 












100 


Cys 


Pne 


Asp 


Asn 






115 




Met 


Gly 


Tyr 


Ser 




130 






Asp 


Gin 


Asp 


Leu 


145 








Arg 


wen 


Arg 


Asn 


Leu 


His 


Cys 


Leu 








180 


Gly 


Gly 


Glu 


Glu 






135 




Gin 


Tyr 


Asp 


Lys 




210 






Trp 


Val 


Leu 


Thr 


225 








Asn 


Trp 


Lys 


Val 


Leu 


Ala 


Val 


Ala 








260 


Lys 


Asp 


Asn 


Asp 






275 




Ser 


Gly 


Thr 


Val 




290 






Thr 


Pro 


Ala 


Thr 


305 








Asn 


Gly 


Gly 


Lys 


lie 


Asp 


Ser 


Thr 








340 


Thr 


Glu 


Lys 


Met 






355 




Cys 


Gin 


Gly 


Asp 




370 






His 


Val 


Val 


Gly 


385 








Thr 


Pro 


Gly 


Val 


Asn 


Val 


Trp 


Lys 








420 



Giy 


Pro 


Ala 


Val 


85 








val 


Leu 


Asp 


Ser 


Pne 


Thr 


Glu 


Ala 








120 


Ser 


Lys 


Pro 


Thr 






135 




Asp 


Val 


Val 


Glu 




150 






Ser 


Ser 


Gly 


Pro 


165 








Ala 


Cys 


Gly Lys 


Ala 


Ser 


val 


nop 








200 


Gin 


His 


Val 


Cys 






215 


Ala 


Ala 


His 


Cys 




230 






Arg Ala 


Gly 


Ser 


245 








Lys 


lie 


He 


lie 


lie 


Ala 


Leu 


Met 








280 


Arg 


Pro 


lie 


Cys 






295 




Pro 


Leu 


Trp 


He 




310 






Met 


Ser 


Asp 


He 


325 








Arg 


Cys 


Asn 


Ala 


Met 


Cys 


Ala 


Gly 








360 


Ser Gly 


Gly. 


Pro 






375 




lie 


Val 


Ser 


Trp 




390 






Tyr 


Thr 


Lys 


Val 


405 








Ala 


Glu 


Leu 





Ala Val Arg Leu 
90 

Ala Thr Gly Asn 
105 

Leu Ala Glu Thr 

Phe Arg Ala Val 

140 

He Thr Glu Asn 
155 

Cys Leu Ser Gly 
170 

Ser Leu Lys Thr 
185 

Ser Trp Pro Trp 

Gly Gly Ser He 

220 

Phe Arg Lys His 
235 

Asp Lys Leu Gly 
250 

He Glu Phe Asn 
265 

Lys Leu Gin Phe 

Leu Pro Phe Phe 

300 

He Gly Trp Gly 
315 

Leu Leu Gin Ala 
330 

Asp Asp Ala Tyr 
345 

He Pro ;Glu Gly 

Leu Met Tyr Gin 

380 

Gly Tyr Gly Cys 
395 

Ser Ala Tyr Leu 
410 



Ser Lys Asp Arg 
95 

Trp Phe Ser Ala 
110 

Ala Cys Arg Gin 
125 

Glu He Gly Pro 

Ser Gin Glu Leu 

160 

Ser Leu Val Ser 
175 

Pro Arg Val Val 
190 

Gin Val Ser He 
205 

Leu Asp Pro His 

Thr Asp Val Phe 

240 

Ser Phe Pro Ser 
255 

Pro Met Tyr Pro 
270 

Pro Leu Thr Phe 
285 

Asp Glu Glu Leu 

Phe Thr Lys Gin 

320 

Ser Val Gin Val 
335 

Gin Gly Glu Val 
350 

Gly Val Asp Thr 
365 

Ser Asp Gin Trp 

Gly Gly Pro Ser 

400 

Asn Trp He Tyr 
415 



